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1.  INTRODUCTION 

Systemic  sclerosis  (SSc)  is  a  heterogeneous  disease  of  fibrosis  and  inflammation,  concomitant  with  significant 
autoimmunity.  SSc  often  presents  with  skin  manifestations  and  Raynaud’s  phenomenon;  the  extent  and 
location  of  fibrotic  lesions  in  people  with  SSc  contributes  to  the  diagnoses  of  disease  subtypes  and  prognosis. 
My  laboratory  has  pioneered  the  use  of  gene  expression  subsets  in  SSc  [2-61.  Most  recently  we  have 
demonstrated  enrichment  of  a  mycobiome  component  (Rhodotorula  glutinis)  in  SSc  patient  skin  [7]. 

We  describe  our  studies  from  the  first  year  of  the  grant  below.  This  work  was  accomplished  by  researchers  at 
Geisel  School  of  Medicine  at  Dartmouth,  Boston  University  Medical  Center  and  University  of  California,  San 
Francisco  under  the  partnering  PI  option. 

2.  KEYWORDS: 

IMSA,  systemic  sclerosis,  scleroderma,  SSc,  mycobiome,  microbiome,  fibrosis,  gene,  genetics,  RNA-seq,  Next 
Generation  Sequencing,  skin,  R.  glutinis,  Rhodotorula,  Metagenomics 

3.  ACCOMPLISHMENTS 

Milestones  were  assigned  to  this  proposal,  with  tasks  to  be  accomplished  by  each  investigator.  The  overall 
summary  of  our  progress  relative  to  these  tasks  is  given  below,  followed  by  a  complete  discussion  of  our  work 
this  past  year. 

Milestone  1  Determine  the  identity  and  distribution  of  microbiome  components  across  SSc  skin. 

Task  1  (Months  1-36)  Whitfield  Laboratory  to  perform  RNA-seq  analysis  of  SSc  skin  biopsies. 

Including  technical  replicates,  RNA-seq  has  been  run  on  22  SSc  patient  skin  biopsies  and  6  healthy 
controls  to  date;  an  additional  8  SSc  patient  biopsies  are  currently  being  processed.  Recruitment  of 
additional  SSc  patients  and  healthy  controls  is  ongoing. 

Task  2  (Months  6-36)  Whitfield  Laboratory  to  perform  RNA-seq  analysis  for  differentially  expressed  mRNAs 
and  non-coding  RNAs. 

Raw  sequence  reads  have  been  analyzed  using  publicly  available  software  packages  that  have  been 
optimized  and  validated  by  us. 

Task  3  (Months  6-36)  Arron  group  to  perform  IMSA  and  determine  the  identity  of  microbiome  components. 

Dr.  Arron’ s  group  is  currently  performing  metagenomic  analysis  on  28  new  and  existing  skin  biopsy 
samples  (22  SSc  and  6  healthy  controls)  from  the  Whitfield  Laboratory. 

Task  4  (Months  1-24)  Arron  group  to  create  scaffolds  from  aligned  reads  for  each  microbiome  component  and 
develop  nested  PCR  followed  by  targeted  multiplexed  sequencing  assays  for  cost-effective  screening. 

We  developed  a  fast  and  efficient  nested  PCR  reaction  targeting  microbiome  components  specific  for 
fungal  species  identification.  We  then  evaluated  the  identity  and  number  of  fungal  reads  by  next 
generation  sequencing  (see  Task  5).  We  have  found  the  data  from  this  targeted  mycobiome  method  to  be 
lower  quality  than  our  metagenomic  RNA-seq  analyses  and  we  are  exploring  targeted  microbiome 
methods. 

Our  preliminary  findings  suggest  significant  bacterial  dysbiosis  in  affected  skin,  with  only  modest  changes 
in  fungal  abundance.  Validation  of  these  findings  will  therefore  be  run  using  targeted  1 6S  sequencing  to 
identify  changes  in  bacterial  composition.  DNA  has  been  collected  for  116  skin  samples  (SSc  and  healthy 
controls)  to  date,  with  an  additional  178  samples  awaiting  DNA  purification.  Targeted  sequencing  of  the 
V2-4-8  and  V3-6J-9  hypervariable  regions  of  16S  rRNA  from  these  samples  will  be  performed  using  the 
Ion  1 6S  Metagenomics  Kit  (ThermoFisher),  and  sequenced  on  the  Ion  Torrent. 
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The  original  goal  of  this  task  was  validation.  Therefore,  validation  will  also  be  performed  by  analyzing 
RNA-seq  data  generated  as  part  of  other  studies  (n  >  100).  This  approach  enables  direct  assessment  of 
all  microbiome  components  associated  with  affected  skin,  including  archaea,  bacteria,  viruses,  fungi,  and 
parasites,  as  well  as  comparisons  between  microbiome  composition  and  gene  expression  of  associated 
tissues  using  the  same  computational  methods. 

Task  5  (Months  1-12)  Whitfield  Laboratory  to  examine  a  larger  population  of  archived  skin  biopsy  RNA  to 
determine  the  prevalence  of  microbiome  components  across  the  SSc  population. 

See  above. 

Task  6  (Months  1-24)  Culture  microbiome  components  from  the  skin  of  SSc  patients.  Use  of  skin  biopsies  as  a 
method  for  fungal  culture  was  not  successful. 

Disappointingly,  swabbing  of  affected  skin  of  SSc  patients  has  yet  to  result  in  recovery  of  clinically  relevant 
fungi  or  other  organisms.  Swabbing  and  other  culture-based  methods  of  microbial  detection  will  be 
revisited  following  completion  of  our  RNA-seq  analyses,  enabling  targeted  isolation  of  organisms 
associated  with  SSc  lesional  skin. 

Milestone  2  Identify  the  inflammatory  infiltrates  in  SSc  skin  and  their  response  to  microbiome  components 

Task  1  (Months  1-6)  Whitfield  Laboratory  to  perform  computational  analysis/prediction  of  inflammatory  cell 
infiltrates  from  whole  genome  expression  data. 

We  used  single  sample  Gene  Set  Enrichment  Analysis  (ssGSEA)  to  identify  the  cellular  subsets  in  SSc 
skin  at  different  stages  of  disease. 

Task  2  (Months  6-24)  Perform  immunohistochemistry  to  validate  the  computational  predictions  of  task  1. 

We  have  optimized  markers  for  different  cell  types  in  SSc  skin.  We  have  used  CD163  for  macrophages, 
CD  1c  for  myeloid  dendritic  cells  (mDCs),  and  CD3  for  T  cells  in  a  separate  study.  Current  work  focuses  on 
the  identification  of  T  cell  subsets  (CD4  and  CD8),  and  B  cells  (CD19  and  CD20).  We  can  now  use  these 
markers  to  look  at  innate  and  adaptive  immune  cells  in  the  patients  of  this  study. 

Task  3  (Months  1-18)  Whitfield  Laboratory  to  develop  protocols  for  the  isolation  and  characterization  of 
immune  cells  from  skin  using  the  sclerodermatous  Graft-Versus  Host  Disease  (scIGVHD)  mouse  including 
detailed  characterization  of  cell  types. 

We  established  the  scIGVHD  model  in  the  laboratory.  We  have  performed  initial  cell  isolations  and 
phenotyping  of  these  samples. 

Task  4  (Months  6-18)  Identify  the  secreted  mediators  of  fibrosis  /  inflammation  being  produced  (Whitfield  / 
Pioli).  Once  cells  are  isolated,  we  will  screen  for  secreted  pro-fibrotic  mediators. 

We  have  analyzed  profibrotic  mediators  in  these  cells,  work  is  ongoing. 

Task  5  (Months  12-36)  Apply  protocols  to  characterize  the  inflammatory  infiltrate  in  the  skin  of  SSc  patients 
(Whitfield  /  Pioli).  After  cell  isolation  procedures  have  been  optimized  in  the  scIGVHD  mouse,  we  will  examine 
the  infiltrate  and  profibrotic  mediators  in  SSc  skin  biopsies. 

Protocols  for  isolating  these  cells  from  mouse  skin  have  been  optimized.  Work  is  underway  for  human 
skin. 

Milestone  3  Determine  if  SSc  patients  have  a  specific  immune  response  against  R.  glutinis  that  is  different 
from  healthy  controls  and  if  this  response  can  drive  fibrosis. 

Task  1  (Months  1-24)  Test  patient  sera  for  cross-reactivity  against  R.  glutinis  antigens  (Whitfield/Lafyatis). 

We  have  performed  western  blots  using  whole  cell  lysates  and  probed  with  sera  collected  from  both 
healthy  controls  and  SSc  patients. 

Task  2  (Months  1-24)  Identify  the  cross-reacting  proteins  by  mass  spectrometry  (Whitfield). 

Serum-immunoprecipitation  of  R.  glutinis  and  human  HeLa  cell  whole  cell  lysates  followed  by  mass 
spectrometry  was  performed  to  identify  immunoreactive  proteins  associated  with  R.  glutinis.  We  have 
written  a  manuscript  on  the  human  cross-reactivity.  We  are  having  difficulty  with  the  annotation  state  of 
the  R.  glutinis  genome  for  annotating  those  spectra. 
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Task  3  (Months  12-36)  Use  isolated  PBMCs  and  isolated  monocytes  to  examine  the  cytokines  secreted  and 
changes  in  gene  expression  when  cells  are  exposed  to  R.  glutinis  or  other  putative  micro  /  mycobiome  triggers 
(Whitfield/Pioli). 

Work  underway. 

Task  4  (Months  12-24)  Determine  if  chronic  exposure  to  R.  glutinis  or  other  micro  /  mycobiome  components 
stimulate  a  fibrotic  response  in  a  mouse  model  of  SSc.  (Whitfield). 

Work  underway. 

PRELIMINARY  RESULTS  BY  MILESTONE 


Milestone  1 :  Determine  the  identity  and  distribution  of  microbiome  components  across  SSc  skin 

Task  1:  RNA-Seq  analysis  of  SSc  skin.  Including  technical  replicates,  RNA-seq  has  been  run  on  18  SSc 
patient  skin  biopsies  to  date.  Recruitment  of  additional  SSc  patients  and  healthy  controls  is  ongoing.  mRNA 
from  22  SSc  patients  and  6  healthy  controls  have  been  sequenced,  yielding  30-237  million  paired-end  reads 
per  sample.  Reads  were  then  aligned  to  the  human  genome  (hg19  assembly).  Approximately  70%-80%  of 
reads  were  uniquely  mapped,  which  is  in  line  with  expectations  (Table  1). 


Table  1.  Statistics  of  alignment 


Sample  ID 

Disease 

Type 

Total  Reads 

Uniquely 

Mapped 

Mapped 

length 

%  multi- 
mapped 

% 

Unmapped 

MK01-FA 

SSc 

79642964 

0.6 

148 

0.37 

0.03 

MK01-B 

SSc 

87465540 

0.45 

148 

0.53 

0.02 

AM02-FA 

SSc 

82459754 

0.7 

148.1 

0.28 

0.02 

AM02-B 

SSc 

89078313 

0.64 

148.1 

0.34 

0.02 

KB03-FA 

SSc 

110881994 

0.8 

148.1 

0.17 

0.03 

KB03-B 

SSc 

80684936 

0.76 

148 

0.14 

0.1 

JP04-FA 

SSc 

86911206 

0.73 

148 

0.15 

0.12 

JP04-B 

SSc 

87609067 

0.54 

148 

0.36 

0.1 

KB05-FA 

SSc 

78252845 

0.72 

148 

0.16 

0.12 

KB05-B 

SSc 

105842068 

0.67 

147.7 

0.27 

0.06 

KL06-FA 

SSc 

103150299 

0.81 

148 

0.15 

0.04 

KL06-B 

SSc 

75717290 

0.64 

148.2 

0.34 

0.02 

SH07-FA 

SSc 

85776797 

0.67 

145.7 

0.13 

0.2 

SH07-B 

SSc 

113066228 

0.75 

148 

0.2 

0.04 

N01_Base 

SSc 

82992452 

0.67 

148 

0.31 

0.03 

N05_Base 

SSc 

77151907 

0.74 

148.1 

0.23 

0.02 

N07-Base 

SSc 

87025838 

0.8 

148.1 

0.18 

0.02 

N09_Base 

SSc 

86468461 

0.73 

148.1 

0.25 

0.02 

N10_Base 

SSc 

100362240 

0.62 

147.7 

0.31 

0.07 

Nll_Base 

SSc 

105056505 

0.47 

147.9 

0.47 

0.05 

N15_Base 

SSc 

81209030 

0.81 

147.8 

0.12 

0.07 

N18_Base 

SSc 

72275968 

0.8 

147.6 

0.13 

0.07 

N13-2 

Control 

80255529 

0.64 

148.1 

0.34 

0.02 

N15-3 

Control 

92668648 

0.67 

147.4 

0.31 

0.02 

N15-5 

Control 

84968648 

0.59 

147.9 

0.38 

0.04 

N15-15 

Control 

92222389 

0.32 

148 

0.61 

0.06 

N 15-21 

Control 

103634414 

0.32 

147.6 

0.58 

0.06 

N15-24 

Control 

75379182 

0.78 

148 

0.14 

0.08 
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Task  2  RNA-seq  analysis  for  differentially  expressed  mRNAs.  In  order  to  identify  differentially  expressed 
mRNA,  we  used  RSEM  software  to  estimate  the  abundance  of  each  mRNA  transcript.  Each  sample  was 
normalized  using  quantile  normalization.  Batch  biases  generated  by  the  inclusion  of  previously  sequenced 
samples  from  a  separate  study  (N_Base  samples)  was  performed  with  ComBat  (Figure  1). 

PC.1  and  PC. 2  for  SSc  miRNA-seq  data  PC.1  and  PC.2  for  SSc  miRNA-seq  data 


S  o 


-50  0  50  -60  -40  -20  0  20  40  60 

PCI  PCI 

As  an  initial  analysis  of  these  data,  we  chose  to  examine  the  consensus  genes  from  Mahoney  et  al.  [4].  These 
are  genes  that  were  consistently  and  reproducibly  associated  with  individual  SSc  intrinsic  gene  expression 
subsets  across  three  independent  patient  cohorts.  Expression  of  these  genes  in  our  RNA-seq  data  reveals 
increased  expression  in  the  inflammatory  and  fibroproliferative  subsets  of  patients  (Figure  2).  Expression  of 
these  genes  is  shown  both  before  and  after  batch  correction.  Intermixing  of  samples  is  clearly  evident  after 
ComBat  correction,  indicating  that  batch  correction  was  successful. 


Figure  1. 
PCA  plot  of 
measurem 
ents.  (Left) 
no  ComBat 
correction; 
(Right)  with 
ComBat 
correction. 


JP04 
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N10 

AM02 

N9 

N15 
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KB03  KB05 

KL06 

Nil 
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N1  N18 
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MK01 

Figure  2.  Heatmapof 
Mahoney_267modules.  (Left)  no 
ComBat  correction;  (Right)  with 
ComBat  correction.  The  attached 
heatmap  only  shows  the  samples 
from  year  1  as  the  current  set  of 
samples  are  undergoing  analysis. 
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Task  3:  IMSA  analysis  to  identity  microbiome  components.  In  order  to  map  the  microbiome  components 
present  in  these  skin  biopsies,  IMSA  has  been  performed  on  RNA-seq  data  from  28  skin  biopsies  (22  SSc  and 
6  healthy  controls).  We  performed  quality  filtering  and  human  sequence  filtering  using  human  genome  (hg19). 
Over  99%  of  the  total  readset  was  derived  from  human  or  nonhuman  primates  in  both  SSc  and  control 
samples.  IMSA  was  used  to  map  reads  to  the  NCBI  non-redundant  nucleotide  (nt)  database  and  generate 
taxonomy  reports. 

In  preliminary  data,  we  found  that  only  inflammatory  samples  have  high  Rhodotorula  glutinis  target  read  counts 
(Figure  3),  consistent  with  the  preliminary  data  we  presented  in  our  initial  grant  proposal.  Therefore,  this 
preliminary  analysis  validates  those  original  data. 
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Figure  3.  IMSA  analysis  of  RNA-seq  data  from  SSc  skin  biopsies.  SSc  skin  biopsies  were  divided  by  intrinsic  gene  expression  subset, 
as  previously  described  [3,  8].  Each  biopsy  was  analyzed  for  R.  glutinis  sequences  and  IMSA  score  plotted  for  each  subset. 


We  used  a  process  called  rarefaction  to  account  for  the  variable  read  depth  of  different  samples.  We  used 
using  Quantitative  Insights  Into  Microbial  Ecology  (QIIME)  to  perform  rarefaction  of  outputs,  a  process  by 
which  taxa  are  randomly  sampled  without  replacement;  this  process  is  necessary  to  ensure  even  sampling 
depth  across  patients.  Alpha  and  beta  diversity  measures  are  calculated  from  these  data. 

Comparisons  of  SSc  and  controls  revealed  statistically  significant  differences  in  abundance  between  five  of  the 
top  seven  genera,  with  observed  increases  in  SSc  skin  in  the  mRNA  of  major  bacterial  commensals  such  as 
Propionibacterium,  Staphylococcus,  and  Corynebacteriurrr,  decreases  were  observed  in  Citrobacter  and  Vibrio, 
relative  to  controls  (p  =  0.019,  p  =  0.004,  and  p  =  0.002,  respectively;  Figure  4A).  Rhodotorula  levels  overall 
were  not  significantly  different  between  SSc  and  controls  in  this  cohort,  which  we  believe  is  likely  the  result  of 
differences  in  patient  populations,  with  this  second  patient  cohort  consisting  primarily  of  later  disease  duration. 
Biopsies  from  additional  8  untreated,  early-stage  SSc  patients  are  being  processed  further  validate  the 
presence  of  R.  glutinis. 
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Figure  4.  Differences  in  microbiome 
composition  between  SSc  patients  and 
controls.  Significant  differences  in  genus- 
level  abundance  and  alpha  diversity  were 
observed  between  groups.  A)  Abundance 
of  major  genera  in  SSc  (blue)  vs  controls 
(green);  red  boxes  indicate  organisms 
significantly  more  abundant  in  SSc.  B) 
Alpha  diversity  analysis  (Chaol)  of  SSc 
patients  by  disease  duration.  Samples 
were  divided  into  5  groups:  controls,  <  2 
years,  2-5  years,  5-10  years,  and  >  10  years. 


Dysbiosis  changes  with  disease  duration.  Alpha  diversity  provides  a  measure  of  inter-sample  diversity. 
Analysis  of  our  SSc  samples  reveals  significant  changes  in  microbiome  composition  associated  with  disease 
duration  (p  =  0.01 1  by  Chaol).  Comparisons  were  performed  using  4  disease  duration  groups,  defined  as  <  2 
years,  2-5  years,  5-10  years,  and  >  10  years,  with  or  without  healthy  controls.  Early  stage  patients  (disease 
duration  <  2  years  after  first  non-Raynaud’s  symptoms)  exhibit  less  microbiome  diversity,  followed  by 
increasing  diversity  thereafter  (Figure  4B).  The  most  significant  differences  were  observed  between  patients 
<6yr  disease  duration  compared  with  those  with  >6yr  (p  =  0.001). 

Linking  microbiome  dysbiosis  to  host  gene  expression  changes.  We  examined  our  data  for  gene 
expression  changes  that  were  linked  to  differences  in  microbiome  composition  in  each  patient  sample.  Gene 
expression  for  each  of  the  28  skin  biopsies  was  calculated  from  the  RNA-seq  data  for  the  intrinsic  genes  from 
Johnson,  et  al.  [1]  (Figure  5).  Hierarchical  clustering  of  these  data  showed  clear  inflammatory  (purple 

branches)  and  non-inflammatory  (green  branches)  patients  (Figure  5A). 
The  inflammatory  group  shows  high  expression  of  genes  associated  with 
immune  activation  (Figure  5C).  In  these  data,  this  split  is  significantly 
associated  with  disease  duration,  with  early  stage  patients  (red  blocks; 
disease  duration  <5  years)  clustering  within  the  inflammatory  group  (Figure 
5B).  With  one  exception,  all  late  stage  patients  (blue  blocks;  disease 
duration  >5  years)  and  healthy  controls  (green  blocks)  clustered  on  the 
non-inflammatory  branch  (Figure  5B).  The  strong  fatty  acid  and  lipid 
metabolism  signature  evident  in  healthy  controls,  and  a  small  number  of 
inflammatory  patients  (Figure  5D),  suggest  multiple  intrinsic  subsets. 
Consistent  with  our  data  above,  patients  in  the  inflammatory  group  have 
low  microbiome  diversity  (i.e.  significant  dysbiosis)  while  samples  in  the 
non-inflammatory  group  have  increased  microbiome  diversity.  Therefore, 
there  is  a  correlation  between  presence  of  the  inflammatory  gene 
expression  signature  and  decreased  microbiome  diversity. 


Figure  5.  Gene  expression  may  influence  microbiome  composition  in  skin.  RNA-seq  analysis  of 
28  skin  biopsies  (22  SSc  and  6  controls)  were  median  centered  by  gene  and  clustered  using  the 
intrinsic  gene  list  described  in  Johnson  et  al  [1]  collapsed  by  unique  gene,  resulting  in  1826 
genes.  A)  Samples  clustered  into  inflammatory  (purple)  and  non-inflammatory  groups  based 
primarily  on  B)  disease  duration  (red  =  <  5  years;  blue  =  >  5  years;  green  =  healthy  controls).  C) 
Inflammatory  and  D)  lipid  signaling  gene  expression  modules  are  the  strongest  signals  affecting 
clustering. 
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Based  on  these  data,  we  find  substantial  differences  between  microbial  populations  of  SSc  skin  with  increasing 
disease  duration.  Within  SSc  patients,  alpha  diversity  was  significantly  lower  in  the  inflammatory  group  relative 
to  non-inflammatory  patients  ( p  =  0.004  for  Chaol),  generally  characterized  by  a  small  number  of  dominant 
genera,  such  as  Escherichia  and  Propionibacterium.  These  data  suggest  that  changes  in  the  cutaneous 
microbiome  of  SSc  patients  may  directly  contribute  to  immune  activation  in  these  patients. 

Tasks  4  and  5.  Develop  a  nested  PCR-based  assay  followed  by  targeted  multiplexed  sequencing  as  a 
cost-effective  method  for  screening  archived  skin  biopsy  RNA  to  determine  the  prevalence  of 
microbiome  components  across  the  SSc  population.  Improvements  in  our  sample-processing  pipeline 
now  allow  for  simultaneous  extraction  of  DNA,  RNA,  and  miRNA  from  all  patient  biopsies.  DNA  is  being  used 
as  a  template  for  targeted  sequencing  of  the  intergenic  transcribed  spacer  regions  (ITS),  a  region  widely 
regarded  as  the  gold  standard  for  fungal  species  identification.  To  date,  targeted  ITS  sequencing  libraries  have 
been  analyzed  from  48  archived  samples  (39  SSc  and  9  controls),  which  includes  both  paired  lesional  and 
non-lesional  skin  as  well  as  multiple  time  points  from  a  single  patient  (Figure  5).  Sequencing  outputs  are  being 
analyzed  by  IMSA  to  identify  differences  in  microbial  diversity  and  species  abundance  between  patients  and 
controls,  between  lesional  and  non-lesion  skin,  as  well  as  how  these  populations  change  over  time. 

Our  preliminary  findings  from  our  RNA-seq  data  suggest  significant  bacterial  dysbiosis  in  affected  skin,  with 
only  modest  changes  in  fungal  abundance.  Validation  of  these  findings  will  therefore  be  run  using  targeted 
16S  sequencing  to  identify  changes  in  bacterial  composition.  DNA  has  been  collected  for  116  skin  samples 
(SSc  and  healthy  controls)  to  date,  with  an  additional  178  samples  awaiting  DNA  purification.  Targeted 
sequencing  of  the  V2-4-8  and  V3-6,7-9  hypervariable  regions  of  16S  rRNA  from  these  samples  will  be 
performed  using  the  Ion  16S  Metagenomics  Kit  (ThermoFisher),  and  sequenced  on  the  Ion  Torrent. 

Figure  6.  Targeted  ITS  sequencing  of  normal  and  SSc  skin  biopsies.  Below  is  a  preliminary  analysis  of  targeted  fungal  ITS 
sequencing  which  shows  a  subset  of  patients  have  increased  R.  glutinis  sequences  (red).  The  most  prominent  fungal 
species  detected  on  skin  were  Malassezia  spp.  (blue),  the  most  common  genus  of  skin  commensal  fungi. 

Proportional  Distribution  of  Fungi  On  Normal  and  SSc  Skin 


Task  6:  Culture  microbiome  components  from  the  skin  of  SSc  patients.  As  mentioned  above,  swabbing 
of  affected  skin  of  SSc  patients  has  unfortunately  yet  to  result  in  recovery  of  clinically  relevant  fungi  or  other 
organisms.  Swabbing  and  other  culture-based  methods  of  microbial  detection  will  be  revisited  following 
completion  of  our  RNA-seq  analyses,  enabling  targeted  isolation  of  organisms  associated  with  SSc  lesional 
skin.  This  may  be  due  to  the  use  of  antiseptics  prior  to  biopsy  collection  as  a  means  of  preventing  infection  of 
the  biopsy  site. 

Milestone  2:  Identify  the  inflammatory  infiltrates  in  SSc  skin  and  their  response  to  microbiome  components 

Task  1:  Computational  prediction  of  inflammatory  cell  infiltrates  from  genomic  expression  data.  We 

have  used  single  sample  Gene  Set  Enrichment  Analysis  (ssGSEA)  to  identify  the  cellular  subsets  in  SSc  skin 
at  different  stages  of  disease.  We  first  benchmarked  the  ssGSEA  method  in  my  laboratory  using  publicly 
available  gene  expression  data  from  pools  of  cell  lines  that  had  a  known  composition  (data  not  shown).  These 
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data  demonstrated  that  ssGSEA  accurately  predicted  cell  type  enrichment.  We  then  analyzed  a  set  of  patients 
for  whom  we  had  whole  genome  expression  data  and  that  had  strong  expression  of  the  inflammatory 
signature.  We  find  the  inflammatory  signature  is  most  strongly  correlated  with  gene  expression  signatures  from 
activated  Dendritic  Cells  (DCs)  and  macrophages  (M0s)  (Figure  6).  These  methods  are  being  applied  in 
conjunction  with  the  samples  being  analyzed  in  milestone  1 . 

Figure  7.  Correlation  of  cell  type  signatures  with  a  patient's  inflammatory  signature  normalized  enrichment  score 
(NES).  The  inflammatory  signature  in  SSc  skin  is  most  highly  correlated  with  activated  DCs  and  M0s. 
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Task  2:  Perform  immunohistochemistry  to  validate  the  computational  predictions  of  task  1  above.  We 

have  optimized  markers  for  different  cell  types  in  SSc  skin. 

We  have  optimized  markers  for  different  cell  types  in  SSc  skin.  We  have  used  CD163  for  macrophages,  CDIc 
for  myeloid  dendritic  cells  (mDCs)  and  CD3  for  T  cells  in  a  separate  study.  We  can  now  use  these  markers  to 
look  at  innate  and  adaptive  immune  cells  in  the  patients  of  this  study.  We  have  optimized  these  stains  in  a  set 
of  SSc  samples  and  will  now  be  performing  these  stains  in  samples  for  this  study.  T  cell  subsets  will  be 
identified  using  antibodies  against  CD4  and  CD8,  and  CD19  and  CD20  will  be  used  to  determine  B  cell 
localization. 

Task  3:  Develop  protocols  for  the  isolation  and  characterization  of  immune  cells  from  skin  using  the 
sclerodermatous  Graft-Versus  Host  Disease  (scIGVHD)  mouse  including  detailed  characterization  of 
cell  types. 

We  established  the  scIGVHD  model  in  the  laboratory  and  can  recapitulate  both  skin  thickening  (Figure  8)  and 
the  aberrant  gene  expression  profiles  observed  in  our  prior  studies  (Figure  9).  We  have  performed  initial  cell 
isolations  and  phenotyping  of  these  samples  (Figure  10). 


A.  Allogeneic  Transfer  (scIGVHD)  B.  Syngeneic  Transfer  (control) 


Figure  8.  Immunohistochemistry  was  performed  to  show  increased  fibrosis  at  2  weeks  after  disease  intitiation.  As  expected  we 
observe  skin  thickening  in  the  scIGVHD  mouse  that  is  not  observed  in  controls. 


Gene  expression  analyses  were  performed  on  skin  biopsies  from  the  scIGVHD  mouse  and  compared  to  our 
prior  study  of  this  model  [9].  We  find  gene  expression  changes  were  produced  and  consistent  with  those 
observed  when  the  model  was  generated  at  Harvard  (Figure  9).  Therefore,  we  can  clearly  reproduce  this 
model  faithfully,  including  the  molecular  SSc  phenotype. 
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Figure  9:  Gene  expression  analysis  of  the  scIGVHD  mouse.  We  performed  gene  expression 
microarray  analyses  of  the  skin  of  the  scIGVHD  mouse  generated  in  Dr.  Whitfield's  lab  for  this 
study  with  data  from  the  scIGVHD  taken  from  Greenblatt  et  al.  2012  [9].  We  find  that  samples 
from  mice  generated  in  this  project  (highlighted  boxes)  faithfully  recapitulate  the  aberrant 
gene  expression  observed  in  our  original  study.  These  gene  expression  data  are  consistent 
with  what  we  observe  in  the  inflammatory  subset  of  SSc. 

Skin  tissue  was  minced,  digested  with  collagenase  D  and  DNase  I,  and  filtered 
through  70-  and  40-micron  mesh  to  facilitate  cell  dispersion.  Single  cell 
suspensions  were  stained  with  antibodies  directed  against  the  pan-leukocyte 
marker  CD45,  myeloid  cell  markers  CDIIb  and  CDIIc,  and  CD115,  CD206, 
and  murine  MHC-Class  II  (IA/IE)  for  flow  cytometric  analysis  (Figure  10). 
Gating  of  positively  stained  cells  was  determined  using  fluorescence-minus- 
one  (FMO)  controls.  CD45  positive  live  cells  were  gated  and  surface 
expression  of  CDIIc,  which  is  a  murine  dendritic  cell  marker,  and  CDIIb, 
which  is  highly  expressed  on  mouse  macrophages,  was  analyzed  on  the 
CD45+  cell  population  (Figures  10A  and  10B).  Consistent  with  previous  reports 
[9,  10],  there  is  an  increase  in  skin  macrophages  (CD1 1b+CD1 1c  )  in  scIGVHD  mice  compared  with  syngeneic 
transplant  controls.  As  demonstrated  in  Figure  10C,  the  macrophage  cell  population  is  characterized  by 
significantly  increased  expression  of  the  CSF-1R  CD115  and  murine  MHC  Class  II  (IA/IE).  Surface  levels  of 
the  mannose  receptor  CD206  were  also  elevated,  although  they  did  not  reach  statistical  significance.  These 
findings  are  consistent  with  results  obtained  by  Greenblatt  et  al.,  indicating  that  we  have  established  this  model 
for  our  future  analysis.  Furthermore,  these  results  suggest  that  the  activation  profile  of  SSc  macrophages  is 
unlikely  to  conform  to  a  uniformly  pro-  or  anti-inflammatory  polarization  state,  as  CD115  and  CD206  are 
typically  expressed  by  alternatively  activated  macrophages  and  enhanced  IA/IE  surface  levels  are 
characteristic  of  pro-inflammatory  macrophages. 
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Figure  10.  Characterization  of  the  Myeloid 
Cell  Population  in  skin  of  the  2  week  old 
scIGVHD  mouse.  Flow  cytometric  analysis  of 
CD45+  cells  derived  from  back  skin  of  BALB/c 
Rog2'/  hosts  that  received  either  syngeneic 
BALB/c  or  allogeneic  B10.D2  splenocytes  two 
weeks  prior  to  cell  harvest  (n=3  per  group):  A. 
Gating  strategy  for  selection  of  CDIIb  vs. 
CDIIc  positive  cells.  B.  Percentage  of 
CDllb/CDllc  cell  populations  in  syngeneic 
(control)  vs.  allogeneic  (experimental) 
recipients  measured  in  panel  A.  and  C. 
Percentages  and  mean  fluorescence  intensity 
(MFI)  of  three  characteristic  macrophage 
markers  (CD115,  CD206,  and  IA/IE)  on  gated 
macrophage  population  (CDllbVCDllc)  in 
syngeneic  (black  bars)  vs.  allogeneic  (red  bars) 
recipients. 


Task  4:  Identify  the  secreted  mediators  of  fibrosis  /  inflammation  being  produced  (Whitfield  /  Pioli). 
Once  cells  are  isolated,  we  will  screen  for  secreted  pro-fibrotic  mediators. 

We  have  analyzed  profibrotic  mediators  in  these  cells,  work  is  ongoing. 

Task  5:  Apply  protocols  to  characterize  the  inflammatory  infiltrate  in  the  skin  of  SSc  patients. 
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Protocols  for  isolating  these  cells  from  mouse  skin  have  been  optimized.  Work  is  underway  for  human 
skin. 


Milestone  3:  Determine  if  SSc  patients  have  a  specific  immune  response  against  R.  glutinis  that  is  different 
from  healthy  controls  and  if  this  response  can  drive  fibrosis. 

Task  1:  Test  patient  sera  for  cross-reactivity  against  R.  glutinis  antigens.  In  these  experiments,  we  set 
out  to  test  the  hypothesis  that  autoantibody  reactivity  observed  in  SSc  could  recognize  the  same  proteins  in 
fungi,  indicating  that  autoantibodies  may  have  originated  in  response  to  fungal  infection.  Western  blots  were 
performed  using  R.  glutinis,  Malassezia  furfur,  Saccharomyces  cerevisiae,  and  HeLa  whole  cell  lysates  (to  test 
cross-reactivity  with  humans),  and  probed  with  sera  collected  from  both  healthy  controls  and  SSc  patients 
representing  the  three  major  autoantibody  groups  (Controls,  CENP,  TOPI,  and  RNAP3).  Clear  differences  in 
cross-reactivity  were  evident  between  patient  subsets.  SSc  patients  showed  a  pattern  of  cross  reactivity 
against  R.  glutinis  lysates  that  was  distinct  from  that  observed  in  healthy  controls.  Among  clinical  autoantibody 
groups,  a  band  consistent  with  the  presence  of  TOPI  was  seen  in  3  of  4  TOPI  patients  against  R.  glutinis  and 
HeLa  cells  (Figure  7)  this  band  was  not  observed  in  either  M.  furfur  or  S.  cerevisiae,  suggesting  the  possibility 
of  cross-reactivity  between  R.  glutinis  and  human  TOPI  (Figure  7).  Specific  cross  reactivity  was  also  observed 
in  CENP  and  RNAP3  patients;  the  identity  of  these  proteins  is  being  investigated. 

Figure  11.  Western  blots  using  SSc  and  control  Sera 


HeLa  M.  furfur 


| 

1  1 

141 

1 

Control 

CENP 

TOPI 

RNAP3 

Control 

CENP 

TOPI 

RNAP3 

R.  glutinis  S.  cerevisiae 


Task  2.  Identify  the  cross-reacting  proteins  by  mass  spectrometry.  Serum-immunoprecipitation  of  R. 
glutinis  and  HeLa  cell  whole  cell  lysates  followed  by  mass  spectrometry  was  performed  to  identify 
immunoreactive  proteins  associated  with  R.  glutinis.  As  part  of  this  investigation,  we  performed  a  high- 
throughput  analysis  of  all  autoantibodies  present  in  SSc  sera,  revealing  novel  targets  and  associated 
processes  associated  with  SSc,  which  is  now  in  press  (Johnson,  et  al.  Arthritis  Research  and  Therapy,  2016). 

Serum-immunoprecipitation  of  R.  glutinis  whole  cell  lysates  followed  by  mass  spectrometry  revealed 
considerable  reactivity  in  both  SSc  patients  and  healthy  controls;  however,  identification  of  target  peptides  was 
not  possible  due  to  the  absence  of  a  sufficiently  well-annotated  R.  glutinis  proteome.  Cross  reactivity  of 
autoantibodies  against  human  and  fungal  proteins  has  been  confirmed  in  S.  cerevisiae,  with  autoantibodies 
against  human  TOPI  strongly  cross-reactive  to  the  equivalent  protein  in  S.  cerevisiae. 

Task  3:  Use  isolated  PBMCs  and  isolated  monocytes  to  examine  the  cytokines  secreted  and  changes 
in  gene  expression  when  cells  are  exposed  to  R.  glutinis  or  other  putative  micro  /  mycobiome  triggers. 

Work  is  underway.  We  have  shown  that  PBMCs  and  macrophages  from  SSc  patients  (monocytes 
isolated  from  peripheral  blood  of  SSc  patients  that  are  differentiated  in  autologous  sera)  are  activated 
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under  basal  conditions,  as  these  cells  secrete  pro-and  anti-inflammatory  mediators  in  the  absence  of 
exogenous  activation.  Therefore,  additional  stimulation  with  LPS  does  not  appear  to  further  activate  the 
cells.  These  activated  cells  produce  a  wide  range  of  pro-fibrotic  molecules  that  have  been  implicated  in 
SSc,  including  IL-6  and  TGFbeta.  We  believe  these  are  the  major  drivers  of  fibrosis  in  SSc. 

Task  4:  Determine  if  chronic  exposure  to  R.  glutinis  or  other  micro  /  mycobiome  components  stimulate 
a  fibrotic  response  in  a  mouse  model  of  SSc. 

Work  underway.  The  scIGVHD  mouse  has  been  established  and  this  task  should  be  completed  in  the 
coming  year. 

CONCLUSION: 

We  have  made  significant  progress  on  all  milestones  for  the  first  two  years  of  our  grant.  We  have  sequenced 
samples,  and  are  actively  analyzing  gene  expression  changes  and  performing  metagenomic  analyses  for 
micro-  and  mycobiome  composition.  Analyses  of  the  data  have  revealed  widespread  dysbiosis  in  SSc  skin, 
with  additional  sequencing  ongoing  to  both  confirm  and  expand  on  these  results.  Computational  analyses 
have  identified  activated  DCs  and  M0s  as  the  key  cell  types  driving  the  inflammatory  signature,  a  phenotype 
consistent  with  the  presence  of  a  mycobiome  trigger;  these  results  are  now  being  confirmed  experimentally. 
We  have  generated  the  scIGVHD  mouse  model,  optimized  our  cell  isolation  procedures  and  methods  for 
isolating  macrophages  from  these  mice.  Immuno-phenotyping  and  response  to  microbiome  components  is  in 
process.  We  have  analyzed  the  cross-reactivity  of  autoantibodies  with  human  and  fungal  components.  A 
paper  reporting  the  autoantibody  cross-reactivity  to  human  proteins  in  HeLa  cells  has  been  published  [111. 

KEY  RESEARCH  ACCOMPLISHMENTS  Summary 

•  RNA-seq  followed  by  metagenomic  analyses  for  microbiome  components  reveals  dysbiosis  as  a 
common  feature  of  SSc  skin,  which  increases  with  disease  duration. 

•  Our  work  has  shown  that  the  innate  immune  system  (macrophages  and  DCs)  are  likely  drivers  of  SSc 
and  that  these  cells  respond  to  microbiome  components. 

•  We  have  successfully  established  the  scIGVHD  model  in  the  laboratory  and  are  examining  the  immune 
drivers  and  their  response  to  microbiome  components. 

•  Cross  reactivity  with  microbiome  component  is  observed  and  there  are  data  to  suggest  basal  activation 
of  immune  cells  in  SSc. 

The  next  reporting  period: 

September  2016-September  2017 

4.  IMPACT 

What  was  the  impact  on  the  development  of  the  principal  discipline(s)  of  the  project? 

The  major  impact  of  this  project  is  that  we  are  demonstrating  a  novel  paradigm  for  the  initiation  of  SSc.  This 
has  the  potential  to  dramatically  change  the  way  we  think  about  SSc  and  the  role  of  the  innate  immune  system 
in  driving  disease. 

What  was  the  impact  on  other  disciplines? 

This  study  impacts  areas  of  genomics,  metagenomics,  microbiology,  innate  immunity,  and  autoimmunity.  The 
methods  we  demonstrate  and  develop  here  will  affect  all  of  these  fields.  In  particular,  this  study  begins  to 
develop  methods  for  both  systems  biology  and  metagenomic  sequencing  analyses  that  can  be  used  in  other 
rare  diseases. 

What  was  the  impact  on  technology  transfer? 

Technical  demands  associated  with  this  project  necessitated  the  development  of  a  novel  method  to  isolate 
DNA,  RNA,  and  miRNAs  from  a  single  skin  biopsy.  This  method  has  been  submitted  as  a  disclosure  to  our 
technology  transfer  office  (TTO). 

What  was  the  impact  on  society  beyond  science  and  technology? 
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Scleroderma  is  an  incurable  disease  that  often  has  a  very  poor  prognosis.  If  our  metagenomic  results  are 
confirmed,  this  will  provide  not  only  a  better  understanding  of  the  molecular  processes  driving  disease 
pathogenesis,  but  also  identify  alternative  strategies,  such  as  anti-fungal  treatment,  as  a  possible  treatment  for 
SSc. 

5.  CHANGES/PROBLEMS 

None  to  report. 

6.  PRODUCTS: 

None  at  this  time. 

Oral  Presentations:  (Chronological  Order) 

Presentations  for  Michael  L.  Whitfield,  PhD 

6/16  “ Systems  biology  and  bioinformatics  approaches  to  evaluating  disease  mechanisms  and 

therapeutic  trials  in  autoimmunity  and  fibrosis,,  University  of  North  Carolina  at  Chapel  Hill, 
Chapel  Hill,  NC 

6/16  “ Burroughs  Wellcome  Training  Program:  Big  Data  in  the  Life  Sciences ”  Burroughs  Wellcome 

Fund,  Durham  NC 

5/16  “ Systems  biology  and  bioinformatics  approaches  to  evaluating  disease  mechanisms  and 

therapeutic  trials  in  autoimmunity  and  fibrosis ”  University  of  Michigan,  Rheumatology  Grand 
Rounds,  Ann  Arbor  MI 

4/16  “The  genome  and  scleroderma ’s  social  network'  Scleroderma  Foundation  New  England,  Patient 

Education  Seminar,  Boston  MA 

3/16  “Multi-organ  systems  biology  reveals  a  common  immune-fibrotic  axis  in  systemic  sclerosis, 

pulmonary  fibrosis  and  pulmonary  arterial  hypertension”  Scleroderma  Research  Foundation 
Annual  Workshop,  San  Francisco,  CA 

2/16  “ Systems  biology  and  bioinformatics  approaches  to  understand  complex  autoimmune  diseases ” 

University  College  London,  London  UK 

2/16  “ Integrative ,  multi-organ  systems  biology  of  systemic  sclerosis  reveals  a  macrophage  signature 

associated  with  disease  severity  in  multiple  end-target  tissues ”  Scleroderma  World  Congress, 
Lisbon  Portugal 

2/16  “ Systems  biology  and  bioinformatics  approaches  to  understand  complex  autoimmune  diseases ” 

Pittsburgh  School  of  Medicine,  Pittsburgh,  PA 

2/16  “ Advances  in  understanding  pathogenesis  and  treatment  in  systemic  sclerosis ”  Rheumatology 

Grand  Rounds,  Dartmouth-Hitchcock  Medical  Center. 

12/15  “fi/'g  Data  in  the  Life  Sciences’’’  North  Carolina  Central  University,  Durham,  NC 

12/15  “fi/'g  Data  in  the  Life  Sciences ”  North  Carolina  State  University,  Raleigh,  NC 

12/15  “fi/'g  Data  in  the  Life  Sciences ”  University  of  North  Carolina,  Chapel  Hill,  NC 

11/15  “ Multi-tissue  genomic  networks  and  systems  biology  in  systemic  sclerosis ”.  Scleroderma 

Foundation  Workshop,  ACR  Annual  meeting,  San  Francisco  CA 
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8/15  “Systems  Biology  in  Systemic  Sclerosis.  ”  Session  Chair  and  topic  introduction.  Scleroderma 

Basic  Science  Workshop,  Cambridge  UK. 

6/15  “ Defining  overlapping  pathology  between  SSc  patients  and  commonly  used 

mouse  models  of  disease ”  Actelion,  Basel  Switzerland.  ( cancelled  due  to  illness) 

6/15  “ Genomic  and  Proteomic  Quantification  of  the  Heterogeneity  of  SSc:  Implications  for 

Pathogenesis  and  Treatment ”.  EULAR.  Rome,  Italy.  ( cancelled  due  to  illness ) 

6/15  “ Genomics ,  Bioinformatics  and  Systems  Biology  for  Precision  Medicine  in  Systemic  Sclerosis ”. 

NIH  CORT  (P50)  Advisory  Committee  meeting.  Boston  University  Medical  Center,  Boston  MA 

4/15  “ Genomics ,  Bioinformatics  and  Systems  Biology  for  Precision  Medicine  in  Systemic  Sclerosis” . 

SScores  (NIH  P30)  Advisory  Committee  meeting.  Boston  University  Medical  Center,  Boston 
MA. 

3/15  “ A  macrophage-associated  inflammatory  signature  is  found  in  all  SSc  tissues  and  associated 

with  more  severe  disease ”  Scleroderma  Research  Foundation  Workshop  on  Scleroderma,  San 
Francisco,  CA 

3/15  “ Molecular  stratification  and  drug  response  for  SSc  clinical  trials ”  Pfizer,  Cambridge,  MA. 

2/15  “ Enabling  Precision  Medicine  in  SSc  Clinical  Trials ”  Discussion  leader  and  presenter,  NIAMS 

roundtable  discussion  on  Scleroderma:  Advancing  Potential  Drugs  to  Patient  Care 

2/15  “ Linking  autoimmune  systemic  sclerosis  and  cancer:  disease  stratification,  co-expression 

networks  and  genetic  polymorphisms ”  Cancer  Mechanisms  Program,  Norris  Cotton  Cancer 
Center. 

1/14  “ Mechanisms  of  Systemic  Sclerosis  (Scleroderma)  pathogenesis  by  systems  level  genomic 

analyses ”  Genomic  Medicine  Grand  Rounds,  Geisel  School  of  Medicine. 

12/14  “ Untangling  molecular  changes  in  SSc  clinical  trials:  Gene  expression  subsets,  response 

signatures  and  pathway  changes ”  ASSET  Investigator  Meeting.  University  of  Michigan,  MI 

11/14  “ Identification  of  the  Microbiome  As  a  Potential  Trigger  of  Systemic  Sclerosis  By  Metagenomic 

RNA-Sequencing  of  Skin  Biopsies ”  ACR  Basic  Research  Conference  Boston,  MA. 

Dr.  Sarah  Arron  reports  no  presentations  on  this  topic  in  the  past  year. 

Abstracts  and  Presentations:  (Chronological  Order) 

1 .  Michael  E.  Johnson,  Zhenghui  Li,  Michelle  T.  Dimon,  Tammara  A.  Wood,  Robert  Lafyatis,  Sarah  T. 
Arron,  Michael  L.  Whitfield.  Identification  of  the  microbiome  as  a  potential  trigger  of  systemic 
sclerosis  by  metagenomic  RNA-sequencing  of  skin  biopsies.  American  College  of  Rheumatology 
Annual  Meeting,  2014 

2.  Zhenghui  Li,  Eleni  Marmarelis,  Kun  Qu,  Lionel  Brooks,  Patricia  A.  Pioli,  Howard  Y.  Chang,  Robert 
Lafyatis,  and  Michael  L.  Whitfield.  RNA-seq  and  miR-seq  analysis  of  SSc  skin  across  intrinsic  gene 
expression  subsets  shows  differential  expression  of  non-coding  RNAs  regulating  SSc  gene  expression. 
American  College  of  Rheumatology  Annual  Meeting,  2014 

Manuscripts: 

The  following  manuscripts  from  Dr.  Whitfield’s  lab  have  relevance  to  this  proposal.  Publication  5  directly 

derives  from  work  performed  to  accomplish  the  aims  of  this  proposal. 


17 


Principal  Investigator  Michael  L.  Whitfield,  Ph.D 

1.  Arron  ST*,  Dimon  MT,  Li  Z,  Johnson  ME,  Wood  T.,  Feeney  L,  Angeles  JG,  Lafyatis  R,  Whitfield 
ML*.  High  Rhodotorula  sequences  in  skin  transcriptome  of  patients  with  diffuse  systemic  sclerosis.  J. 
Invest  Derm.  2014,  Mar  7.  doi:  10.1038/jid.2014.127. 

2.  Johnson  ME,  Mahoney  JM,  Marmarelis  E,  Sargent  JR,  Wu  MR,  Spotts  K,  Hinchcliff  M,  Whitfield 
ML*.  Experimentally-derived  fibroblast  gene  signatures  identify  molecular  pathways  associated  with 
distinct  subsets  of  systemic  sclerosis  patients  in  three  independent  cohorts.  PLoS  One.  2015  Jan 
21;10(l):e01 14017.  doi:  10.1371/joumal.pone.0114017.  eCollection  2015. 

3.  Mahoney  JM,  Taroni  J,  Martyanov  V,  Wood  TA,  Greene  CS,  Pioli  PA,  Hinchcliff  M,  Whitfield  ML*. 
Systems  level  analysis  of  systemic  sclerosis  shows  a  network  of  immune  and  profibrotic  pathways 
connected  with  genetic  polymorphisms.  PLoS  Comput  Biol.  2015  Jan  8;ll(l):el004005.  doi: 
10.1371/joumal.pcbi.l004005.  eCollection  2015  Jan. 

4.  Taroni  JN,  Martyanov  V,  Wood  TA,  Choe  S,  Huang  CC,  Hirano  I,  Yang  GY,  Brenner  D,  Jung  B,  Cams 
M,  Podluski  S,  Chang  RW,  Varga  J,  Whitfield  ML,  Hinchcliff  M.  Genome -wide  gene  expression 
analysis  of  systemic  sclerosis  esophageal  biopsies  identifies  disease-specific  molecular  subsets.  Arthritis 
ResTher.  2015  Jul 29;17:194.  doi:  10.1 186/sl3075-015-0695-l 

5.  Michael  E.  Johnson,  Andrew  V.  Grassetti,  Jaclyn  N.  Taroni,  Shawn  M.  Lyons,  Devin  Schweppe,  Jessica 
K.  Gordon,  Robert  F.  Spiera,  Robert  Lafyatis,  Paul  J.  Anderson,  Scott  A.  Gerber,  Michael  L.  Whitfield. 
Stress  Granules  and  RNA  Processing  Bodies  are  Novel  Autoantibody  Targets  in  Systemic  Sclerosis. 
Arthritis  Res  Ther.  2016  Jan  22;18:27.  doi:  10.1186/sl3075-016-0914-4. 

6.  Sargent  JL,  Li  Z,  Aliprantis  AO,  Greenblatt  M,  Lemaire  R,  Wu  MH,  Wei  J,  Taroni  J,  Harris  A,  Long 
KB,  Burgwin  C,  Artlett  CM,  Blankenhom  EP,  Lafyatis  R,  Varga  J,  Clark  SH,  Whitfield  ML. 
Identification  of  Optimal  Mouse  Models  of  Systemic  Sclerosis  by  Interspecies  Comparative  Genomics. 
Arthritis  Rheumatol.  2016  Aug;68(8):2003-15.  doi:  10.1002/art.39658 

7.  Martyanov  V,  Whitfield  ML.  Molecular  stratification  and  precision  medicine  in  systemic  sclerosis  from 
genomic  and  proteomic  data.  Curr  Opin  Rheumatol.  2016  Jan;28(l):83-8.  doi: 
10.1097/BOR.0000000000000237.  Review. 

8.  Zhenghui  Li,  Guoshuai  Cai,  Michael  S.  Ball,  Kun  Qu,  Patricia  A.  Pioli,  Howard  Chang,  Sarah  Arron, 
Robert  Lafyatis,  and  Michael  L.  Whitfield.  Functional  Characterization  of  Systemic  Sclerosis 
Transcriptome  Identifies  a  Coding  Region  Polymorphism  more  Prevalent  in  Africans  that  affects  IL6 
Production.  In  preparation 

The  following  additional  papers  were  published  by  Drs.  Whitfield,  Lafyatis  and  Arron  during  the  funding 
period. 

1 .  Long  KB,  Li  Z,  Burgwin  C,  Cho  SG,  Martyanov  V,  Sassi-Gaha  S,  Earl  J,  Eutsey  R,  Ahmed  A,  Ehrlich 
GD,  Artlett  CM,  Whitfield  ML,  Blankenhom  EP  *.  The  Tsk2/+  mouse  fibrotic  phenotype  is  due  to  a 
gain-of-function  mutation  in  the  PIIINP  segment  of  the  ColSal  gene.  J.  Invest  Derm.  2014,  Oct  20.  doi: 
10.1038/jid.2014.455. 

2.  Iwamoto  N,  Vettori  S,  Maurer  B,  Brock  M,  Jiingel  A,  Calcagni  M,  Gay  RE,  Whitfield  ML,  Distler 
J.H.W,  Gay  S,  Distler  O*.  Downregulation  of  miR-193b  in  systemic  sclerosis  regulates  the  proliferative 
vasculopathy  by  urokinase-type  plasminogen  activator  expression.  Ann  Rheum  Pis.  2014  Nov  10.  pii: 
annrheumdis-2014-205326.  doi:  10.1136/annrheumdis-2014-205326.  [Epub  ahead  of  print] 

3.  Marangoni  RG,  Korman  B,  Wei  J,  Wood  TA,  Whitfield  ML,  Scherer  PE,  Tourtellotte  WG  and  Varga 
J*.  Myofibroblasts  in  Cutaneous  Fibrosis  Originate  from  Intradermal  Adipocytes.  Arthritis  Rheumatol. 
2015  Apr;67(4):  1062-73.  doi:  10.1002/art.38990. 

4.  Chakravarty  EF,  Martyanov  V,  Fiorentino  D,  Wood  TA,  Haddon  DJ,  Jarrell  JA,  Utz  PJ,  Genovese  MC, 
Whitfield  ML,  Chung  L.  A  Pilot  Randomized  Placebo-Controlled  study  of  Abatacept  for  the  Treatment 
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of  Diffuse  Cutaneous  Systemic  Sclerosis.  Arthritis  Research  &  Therapy,  Arthritis  Res  Ther.  2015  Jun 
13;17(1):159. 

5.  Fresolimumab  treatment  decreases  biomarkers  and  improves  clinical  symptoms  in  systemic  sclerosis 
patients.  Rice  LM,  Padilla  CM,  McLaughlin  SR,  Mathes  A,  Ziemek  J,  Goummih  S,  Nakerakanti  S, 

York  M,  Farina  G,  Whitfield  ML,  Spiera  RF,  Christmann  RB,  Gordon  JK,  Weinberg  J,  Simms  RW, 
Lafyatis  R.  J.  Clin.  Invest.  2015  Jun  22.  pii:  77958.  doi:  10.1 172/JCI77958 

6.  Lisa  M.  Rice,  Jessica  Ziemack,  Eric  Stratton,  Sarah  Mclauglin,  Cristina  Padilla,  Allison  Mathes,  Romy 
Christmann,  Giuseppina  Stifano,  Jeff  Browning,  Michael  L.  Whitfield,  Robert  Spiera,  Jessica  Gordon, 
Robert  Simms,  Yuqing  Zhang,  Robert  Lafyatis.  A  longitudinal  biomarker  for  the  extent  of  skin  disease 
in  patients  with  diffuse  cutaneous  systemic  sclerosis.  Arthritis  Rheumatol.  2015  Nov;67(l  1):3004-15. 
doi:  10.1002/art.39287. 

7.  Gordon  JK,  Martyanov  Y,  Wood  TA,  Spiera  RF,  Whitfield  ML.  Nilotinib  (Tasigna™)  in  the 
Treatment  of  Early  Diffuse  Systemic  Sclerosis:  An  Open-Label,  Pilot  Clinical  Trial.  Arthritis  Res  Ther. 
2015  Aug  18;17:213.  doi:  10.1186/sl3075-015-0721-3. 

8.  Brooks  L,  Lyons  SM,  Mahoney  JM,  Welch  JD,  LiuZ,  Marzluff  WF,  and  Whitfield  ML.  A  multi¬ 
protein  occupancy  map  of  the  histone  mRNP.  RNA.  2015  Nov;21(ll):  1943-65.  doi: 
10.1261/ma.053389.115.  Epub  2015  Sep  16. 

9.  Lyons  SM,  Cunningham  CH,  Welch  JD,  Groh  B,  Guo  AY,  Wei  B,  Whitfield  ML,  Xiong  Y,  Marzluff 
WF.  A  subset  of  replication-dependent  histone  mRNAs  are  expressed  as  polyadenylated  RNAs  in 
terminally  differentiated  tissues.  Nucleic  Acids  Res.  2016  Jul  8.  pii:  gkw620. 

Degrees  obtained  that  are  supported  by  this  award 

Dr.  Zhenghui  Li,  who  worked  on  the  microbiome  project,  will  complete  his  Ph.D  during  year  2  of  funding.  He 
has  received  direct  support  from  this  grant. 

Development  of  cell  lines,  tissue  or  serum  repositories 

None 

7.  PARTICIPANTS  &  OTHER  COLLABORATING  ORGANIZATIONS 

None 

8.  SPECIAL  REPORTING  REQUIREMENTS 

COLLABORATIVE  AWARDS:  For  collaborative  awards,  independent  reports  are  required  from  BOTH  the 
Initiating  PI  and  the  Collaborating/Partnering  PI.  A  duplicative  report  is  acceptable;  however,  tasks  shall  be 
clearly  marked  with  the  responsible  PI  and  research  site.  A  report  shall  be  submitted  to 
https://ers.amedd.army.mil  for  each  unique  award. 

An  identical  final  progress  report  will  be  sent  from  Dr.  Arron 
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Stress  granules  and  RNA  processing  bodies 
are  novel  autoantibody  targets  in  systemic 
sclerosis 

Michael  E.  Johnson1,  Andrew  V.  Grassetti1,  Jaclyn  N.  Taroni1,  Shawn  M.  Lyons2,  Devin  Schweppe1,  Jessica  K.  Gordon3, 
Robert  F.  Spiera3,  Robert  Lafyatis4,  Paul  J.  Anderson2,  Scott  A.  Gerber1  and  Michael  L.  Whitfield1,5* 


Abstract 

Background:  Autoantibody  profiles  represent  important  patient  stratification  markers  in  systemic  sclerosis  (SSc). 
Here,  we  performed  serum-immunoprecipitations  with  patient  antibodies  followed  by  mass  spectrometry  (LC-MS/MS) 
to  obtain  an  unbiased  view  of  all  possible  autoantibody  targets  and  their  associated  molecular  complexes  recognized 
by  SSc. 

Methods:  HeLa  whole  cell  lysates  were  immunoprecipitated  (IP)  using  sera  of  patients  with  SSc  clinically  positive 
for  autoantibodies  against  RNA  polymerase  III  (RNAP3),  topoisomerase  1  (TOPI),  and  centromere  proteins  (CENP).  IP 
eluates  were  then  analyzed  by  LC-MS/MS  to  identify  novel  proteins  and  complexes  targeted  in  SSc.  Target  proteins 
were  examined  using  a  functional  interaction  network  to  identify  major  macromolecular  complexes,  with  direct  targets 
validated  by  IP-Western  blots  and  immunofluorescence. 

Results:  A  wide  range  of  peptides  were  detected  across  patients  in  each  clinical  autoantibody  group.  Each  group 
contained  peptides  representing  a  broad  spectrum  of  proteins  in  large  macromolecular  complexes,  with  significant 
overlap  between  groups.  Network  analyses  revealed  significant  enrichment  for  proteins  in  RNA  processing  bodies 
(PB)  and  cytosolic  stress  granules  (SG)  across  all  SSc  subtypes,  which  were  confirmed  by  both  Western  blot  and 
immunofluorescence. 

Conclusions:  While  strong  reactivity  was  observed  against  major  SSc  autoantigens,  such  as  RNAP3  and  TOPI,  there 
was  overlap  between  groups  with  widespread  reactivity  seen  against  multiple  proteins.  Identification  of  PB  and  SG  as 
major  targets  of  the  humoral  immune  response  represents  a  novel  SSc  autoantigen  and  suggests  a  model  in  which  a 
combination  of  chronic  and  acute  cellular  stresses  result  in  aberrant  cell  death,  leading  to  autoantibody  generation 
directed  against  macromolecular  nucleic  acid-protein  complexes. 

Keywords:  Systemic  sclerosis,  Scleroderma,  Autoantibody,  RNA  processing  bodies,  Stress  granules 


Background 

Systemic  sclerosis  (SSc)  is  a  rare  systemic  autoimmune 
disease  of  unknown  etiology  characterized  by  skin 
fibrosis,  internal  organ  involvement,  vascular  abnor¬ 
malities,  and  autoantibody  production.  Patients  are 
broadly  classified  as  having  either  limited  (ISSc)  or  dif¬ 
fuse  (dSSc)  disease  based  primarily  upon  the  extent  of 
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skin  involvement  and  autoantibody  profiles.  While  a 
wide  array  of  autoantibodies  have  been  described  for 
SSc,  only  a  small  number  of  these  targets  are  used  for 
clinical  diagnosis  and  stratification.  Autoantibodies 
targeting  RNA  polymerase  III  (RNAP3),  topoisomer¬ 
ase  1  (TOPI;  commonly  referred  to  as  Scl70),  and 
centromere  proteins  (CENP)  represent  the  three  the 
most  common,  clinically  measured  autoantibodies  ob¬ 
served  in  SSc  [1,  2].  Other  autoantibodies,  including 
fibrillarin  (U3RNP),  Pm/Scl,  Ku,  U1RNP,  U11/U12, 
and  Th/To  have  also  been  described  [1,  3]  but  are  not 
routinely  measured  for  clinical  subtyping. 
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While  the  processes  underlying  autoantibody  produc¬ 
tion  in  SSc  remain  poorly  understood,  the  presence  of 
certain  autoantibodies  is  strongly  predictive  of  clinical 
outcomes  [1-3].  TOPI  and  RNAP3  autoantibodies  are 
almost  exclusively  seen  in  dSSc,  while  CENP,  Th/To,  and 
U1RNP  antibodies  are  more  commonly  associated  with 
ISSc  [1,  3].  U3RNP  autoantibodies  are  not  associated 
with  either  clinical  subset,  and  are  often  found  in  con¬ 
junction  with  other  autoantibodies,  including  both 
TOPI  and  CENP  [3].  Certain  antibodies,  such  as  TOPI 
and  Ull/12,  have  been  shown  to  be  predictive  of  poorer 
overall  prognosis,  including  increased  likelihood  of  pul¬ 
monary  fibrosis  [4]  and  cardiac  involvement,  while 
RNAP3  autoantibodies  have  recently  been  linked  to  co¬ 
occurrence  of  SSc  with  cancer  [5]. 

Despite  the  importance  of  autoantibodies  in  SSc,  the 
vast  majority  of  target  identification  and  phenotypic 
screening  has  been  performed  using  methods  targeting 
only  a  single  autoantibody,  with  little  ability  to  detect 
novel  or  low  abundance  autoantibodies.  Furthermore, 
these  methods  fail  to  address  the  possibility  of  co¬ 
occurrence  of  multiple  autoantibodies  within  a  patient, 
which  may  have  important  clinical  implications.  Autoan¬ 
tigen  microarrays  have  proven  successful  for  screening 
large  numbers  of  autoantibodies  in  parallel,  however  tar¬ 
get  identification  is  limited  to  those  antigens  produced 
and  printed  on  the  antigen  microarrays  [6].  To  address 
these  limitations,  we  performed  immunoprecipitations 
(IP)  of  HeLa  whole  cell  lysates  using  sera  from  RNAP3-, 
CENP-,  and  TOPI -positive  patients,  as  well  as  healthy 
controls,  followed  by  mass  spectrometry  (LC-MS/MS)  to 
provide  an  unbiased  assessment  of  all  autoantibodies 
present  in  these  SSc  patients.  This  method  provides  a 
better  view  of  the  full  range  of  autoantibodies  present  in 
SSc,  including  both  novel  and  established  targets,  and 
provides  insights  into  the  general  processes  underlying 
autoantibody  production. 

Methods 

Clinical  samples 

Patient  serum  was  obtained  from  Boston  University 
Medical  School,  Boston  (BUMC),  MA,  USA  and  the 
Hospital  for  Special  Surgery  (HSS),  New  York,  NY, 
USA.  All  relevant  study  protocols  were  approved  by 
the  Dartmouth  College  committee  for  the  protection 
of  human  subjects,  and  the  internal  review  boards  of 
both  BUMC  and  HSS.  Informed  consent  was  ob¬ 
tained  from  all  patients  prior  to  sample  collection. 
Patients  were  diagnosed  with  either  dSSc  or  limited 
SSc,  as  determined  using  the  1980  American  College 
of  Rheumatology  classification  criteria.  Detection  of 
major  autoantibody  reactivities  was  performed  using 
standard  clinical  assays. 


Human  cell  lysates 

HeLa  cells  were  cultured  in  DMEM  supplemented  with 
10  %  fetal  bovine  serum  (FBS)  (v/v)  and  100  IU/mL 
penicillin-streptomycin.  Cells  were  grown  to  approxi¬ 
mately  80  %  confluence,  harvested  in  IP  lysis  buffer 
(150  mM  NaCl,  50  mM  Tris  pH  7.5,  1  mM  MgCl2,  1  mM 
EDTA,  0.5  %  Triton  X-100,  2.5  mM  [3-mercaptoethanol, 
1  mM  sodium  molybdate,  1  mM  sodium  fluoride,  1  mM 
sodium  tartrate,  1  mM  dithiothreitol  (DTT),  and  protease 
inhibitors  (Roche,  Indianapolis,  IN,  USA)),  lysed  by  pas¬ 
sage  through  a  pre-chilled  high-gauge  syringe,  and  centri¬ 
fuged  for  15  minutes  to  pellet  debris.  Lysates  were  then 
clarified  by  incubating  for  4  h  at  4  °C  on  a  rotating  plat¬ 
form.  Protein  concentrations  were  quantified  using  a 
standard  bicinchoninic  acid  (BCA)  protein  assay  kit 
(Thermo  Scientific,  Waltham,  MA,  USA). 

Serum  immunoprecipitation 

Patient  serum  was  cross-linked  to  Protein  G  Dynabeads 
(Invitrogen,  St.  Louis,  MO,  USA)  prior  to  IP.  First, 
100  pL  serum  (approximately  1  mg  IgG)  was  added  to 
50  pL  Protein  G  beads  and  incubated  for  5  h  at  4  °C. 
Samples  were  then  washed  in  PBS,  equilibrated  in  cross- 
linking  buffer  (50  mM  HEPES,  pH  8.2),  and  cross-linked 
to  Protein  G  beads  by  the  addition  of  20  mM  dimethyl 
pimelimidate  and  300  mM  HEPES  (DMP)  solution  for 
10  minutes  at  room  temperature  (repeated  three  times). 
The  crosslinking  reaction  was  then  terminated  by  the 
addition  of  50  mM  ammonium  bicarbonate,  and  the 
resulting  antibody  bead  mixture  added  to  500  pL  cell 
lysate  (diluted  to  4  mg/mL  in  IP  lysis  buffer).  Samples 
were  incubated  overnight  at  4  °C  on  a  rotating  platform, 
washed  in  cold  IP  lysis  buffer,  and  eluted  in  a  buffer  con¬ 
taining  2  %  SDS,  75  mM  NaCl,  50  mM  Tris  pH  8.1,  and 
20  %  glycerol  at  65  °C  for  5  minutes.  Eluates  were  re¬ 
duced  by  the  addition  of  0.1  M  dithiothreitol  (DTT)  (to 
a  final  concentration  5  mM),  and  incubated  at  80  °C  for 
5  minutes.  Samples  were  then  resolved  by  SDS -PAGE, 
split  into  high  (>60  kDa)  and  low  (<60  kDa)  molecular 
weight  fractions  and  analyzed  by  mass  spectrometry. 

Mass  spectrometry 

Proteins  contained  in  Coomassie-stained  gel  regions 
were  digested  overnight  with  trypsin  (1:200  w/v)  at  37  ° 
C.  Following  digestion,  peptides  were  extracted  from  the 
gels,  dried,  and  analyzed  by  nanoscale  LC-MS/MS.  LC- 
MS/MS  analyses  were  performed  on  either  LTQ  Orbi- 
trap  Classic  or  Orbitrap  Fusion  LC-MS/MS  platforms. 
LTQ  Orbitrap  Classic  analyses  were  conducted  as  de¬ 
scribed  previously  [7]. 

For  Orbitrap  Fusion  analyses,  samples  were  loaded  onto 
an  EASY-nLC  1000  Liquid  Chromatograph  (Thermo  Sci¬ 
entific,  Waltham,  MA,  USA)  and  separated  by  reverse- 
phase  high  pressure  liquid  chromatography  (RP-HPLC) 
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using  an  approximately  36-cm  column  with  a  100-pM 
inner  diameter  packed  with  3  pm  120  A  C18  particles  (Dr. 
Maisch  GmbH,  Ammerbuch-Entringen,  Germany).  The 
resultant  peptide  eluate  was  directed  into  an  Orbitrap 
Fusion  Tribrid  Mass  Spectrometer  operating  in  a  data- 
dependent  sequencing  acquisition  mode  across  a  30- 
minute  reverse-phase  gradient  (6  %  acetonitrile,  0.1  % 
formic  acid  to  30  %  acetonitrile,  0.1  %  formic  acid)  at 
350  nL/min  flow  rate.  The  Orbitrap  Fusion  was  operated 
with  an  Orbitrap  MSI  scan  at  120  I<  resolution,  followed 
by  Orbitrap  MS2  scans  of  higher  energy  collision-induced 
dissociation  (HCD)  fragment  ions  (30  %  HCD  energy)  at 
15  I<  resolution  using  a  maximum  cycle  type  of  2  s, 
precursor  ion  dynamic  exclusion  window  of  15  s,  +2,  +3, 
and  +4  precursor  ions  selected  for  FC-MS/MS,  and  max¬ 
imum  ion  injection  times  of  100  ms  (MSI)  and  50  ms 
(MS2).  The  resulting  tandem  mass  spectra  were  data- 
searched  using  the  COMET  search  engine  [8]  against  a 
Homo  sapiens  proteome  database  (source:  Uniprot; 
download  date:  2  July  2013)  with  a  precursor  ion  tol¬ 
erance  of  +/-  1  Da  [9]  and  a  fragment  ion  tolerance 
of  0.02  Th.  Peptide  spectra  matches  (PSMs)  were  fil¬ 
tered  to  <1  %  false  discovery  rate  using  the  target 
decoy  strategy  [10],  and  reported. 

IP-western  blots 

Anti-UPFl  antibody  was  kindly  provided  by  Dr.  Fynne 
Maquat  (University  of  Rochester  Medical  Center,  Roch¬ 
ester,  NY,  USA).  Antibodies  to  MOV10  and  CAPRIN1 
were  purchased  from  Proteintech  (Chicago,  IF,  USA); 
antibodies  to  G3BP1  and  USP10  were  purchased  from 
Santa  Cruz  Biotechnology  (Santa  Cruz,  CA,  USA). 
Serum  immunoprecipitation  of  HeFa  lysates  was  per¬ 
formed  as  described  above;  50  %  of  each  eluate  (15  pF) 
was  then  run  on  a  10  %  bis-tris  precast  gel  (Fife  Tech¬ 
nologies,  Carlsbad,  CA,  USA).  HeFa  whole  cell  lysate 
(100  pg)  was  used  as  a  positive  control;  no  loading  con¬ 
trol  was  performed  due  to  the  absence  of  viable  targets 
present  in  all  IP  eluates.  Western  blots  were  then  run 
following  standard  protocols,  and  visualized  using  West¬ 
ern  Fightning  ECF  Pro  or  Ultra  substrate  (Perkin  Elmer 
Inc.,  Waltham,  MA,  USA),  as  necessary. 

Data  analysis 

Non-redundant  peptide  hits,  defined  as  mass  spectra 
mapping  exclusively  to  a  given  peptide  fragment,  were 
used  for  all  downstream  analyses.  Pairwise  comparisons 
between  samples  were  performed  by  Fishers  exact  test 
using  the  Bonferroni  correction  for  multiple  hypothesis 
testing.  Venn  diagrams  were  generated  using  VENNY 
[11].  Network  analysis  was  performed  using  the  Genome- 
scale  Integrated  Analysis  of  gene  Networks  in  Tissues 
(GIANT;  http://giant.princeton.edu/)  global  network  [12] 
and  visualized  using  Cytoscape  [13].  Communities  in  the 


network  were  detected  using  fast-greedy  modularity  as 
implemented  in  igraph.  Functional  annotation  of  individ¬ 
ual  communities  was  performed  using  g:Profiler  [14]. 
Semiquantitative  enrichment  of  SSc-associated  autoanti¬ 
bodies  was  determined  using  a  binary  assessment  of  auto¬ 
antibody  presence  or  absence  in  a  sample.  Preferential 
enrichment  in  SSc  was  defined  as  all  proteins  de¬ 
tected  in  >50  %  of  all  patient  samples  at  a  frequency  >  1.5- 
fold  relative  to  controls.  Enrichment  of  biological  processes 
and  cellular  components  was  determined  using  g:Profiler 
using  the  g:SCS  threshold  correction  for  multiple  hypoth¬ 
esis  testing  and  a  functional  category  size  <500  genes. 
Hierarchical  clustering  was  performed  using  Cluster  3.0 
[15],  and  visualized  using  Java  TreeView  [16]. 

Immunofluorescence 

The  day  prior  to  the  experiment,  105  U20S  cells  were 
seeded  onto  11  mm  glass  coverslips  and  allowed  to  at¬ 
tach  overnight  at  37  °C/5  %  C02  in  DMEM  containing 
10  %  FBS  (Gibco).  Cells  were  treated  with  100  pM  so¬ 
dium  (meta)arsenite  (Sigma  Aldrich)  for  1  h  to  induce 
the  formation  of  stress  granules  and  then  with  4  %  para¬ 
formaldehyde  solution  at  room  temperature  for  15  - 
minutes  followed  by  blocking  and  permeabilization  with 
5  %  normal  horse  serum,  0.1  %  digitonin  in  Tris- 
buffered  saline.  Staining  was  performed  with  anti-eIF3b 
(Santa  Cruz),  anti-SKl-Hedls  (Santa  Cruz),  and  patient 
sera  for  1  h  at  room  temperature.  Secondary  antibodies 
(anti-goat-Cy3,  anti-mouse-Cy2,  and  anti-human-Cy5) 
were  purchased  from  Jackson  Faboratories  and  incubated 
at  room  temperature  for  1  h.  Conventional  fluorescence 
microscopy  was  performed  using  a  microscope  (model 
Elipse  E800,  Nikon,  Tokyo,  Japan)  with  epifluorescence 
optics  with  a  digital  camera  (model  CCD-SPOT  RT;  Diag¬ 
nostic  Instruments,  Sterling  Heights,  MI).  Images  were 
compiled  using  Adobe  Photoshop  software  (CS6;  Adobe 
Systems,  San  Jose,  CA). 

Results 

Identification  of  proteins  cross-reacting  to  serum  antibodies 

Immunoprecipitations  (IP)  of  HeFa  whole  cell  lysates 
were  performed  using  sera  obtained  from  13  SSc  pa¬ 
tients  and  4  healthy  controls.  HeFa  cells  were  chosen 
based  upon  their  consistent,  high  level  of  expression  of  a 
broad  range  of  proteins  from  the  human  genome  [17]. 

SSc  patients  were  divided  into  three  groups,  TOPI, 
RNAP3,  and  CENP,  as  measured  in  a  reference  labora¬ 
tory;  clinical  data  for  each  patient  are  shown  in  Table  1. 
These  groups  were  chosen  based  upon  their  relative 
frequency,  and  their  importance  in  clinical  diagnosis. 
Immunoprecipitated  proteins  were  analyzed  by  FC-MS/ 
MS,  and  the  resulting  spectra  aligned  to  the  reference 
human  proteome  (UCSC  version  hgl9).  Data  are  pre¬ 
sented  in  two  ways;  first  to  identify  the  total  number  of 
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Table  1  Clinical  information  for  patients  involved  in  this  study 


Sample 

Group 

Age  (years) 

Sex 

Race 

Disease  type 

IFD/PAH 

Disease  duration  (years) 

ANA  pattern 

ANA  titer 

MRSS 

SSc  1 

TOPI 

36 

F 

White 

Diffuse 

mild  ILD 

2.5 

1:320 

43 

SSc  132 

TOPI 

49 

F 

White 

Diffuse 

No 

Homogeneous 

1:640 

27 

SSc  218 

TOPI 

55 

F 

White 

Diffuse 

IFD 

Homogeneous/nucleolar 

1:2560 

18 

SSc  208 

TOPI 

64 

M 

White 

Diffuse 

No 

Nucleolar 

1:1280 

37 

SSc  5 

RNAP3 

53 

M 

White 

Diffuse 

No 

0.75 

Speckled 

1:80 

36 

SSc  7 

RNAP3 

45 

F 

Black 

Diffuse 

No 

0.5 

Speckled 

1:80 

27 

SSc  10 

RNAP3 

52 

M 

White 

Diffuse 

No 

0.5 

0 

22 

SSc  18 

RNAP3 

69 

F 

White 

Diffuse 

IFD 

0.5 

Nucleolar 

1:160 

44 

SSc  159 

CENP 

54 

F 

Mixed 

limited 

No 

7 

Centromere 

1:1280 

2 

SSc  177 

CENP 

64 

F 

White 

limited 

No 

15 

Discrete  speckled 

4+ 

SSc  194 

CENP 

66 

F 

White 

limited 

No 

18 

Discrete  speckled 

4+ 

6 

SSc  238 

CENP 

53 

F 

White 

limited 

No 

6 

Centromere 

1:640 

5 

SSc  226 

CENP 

55 

F 

Asian 

Diffuse 

No 

Centromere 

1:1280 

6 

HC  162 

Control 

24 

M 

White 

HC  400 

Control 

21 

M 

White 

HC  117 

Control 

M 

HC  118 

Control 

M 

Blank  cells  indicate  information  not  available  at  the  time  of  sample  collection.  ILD  interstitial  lung  disease,  PAH  pulmonary  arterial  hypertension,  ANA  anti-nuclear 
antibody,  MRSS  modified  Rodnan  skin  score,  SSc  systemic  sclerosis,  M  male,  F  female 


peptides  that  could  be  aligned  to  each  protein  (total 
hits),  and  second  to  identify  all  non-redundant  peptides 
that  mapped  exclusively  to  a  given  protein  (non-redun- 
dant  hits).  A  complete  list  of  all  data  can  be  found  in 
Additional  file  1:  Table  SI. 

Exclusivity  and  co-occurrence  of  SSc  autoantibodies 

We  observed  a  high  degree  of  reproducibility  between 
patients  within  their  respective  autoantibody  groups 
(TOPI,  RNAP3,  and  CENP;  Fig.  1).  The  greatest  degree  of 
overlap  between  peptides  was  observed  among  RNAP3 
patients  (Fig.  la  and  c),  with  420  proteins  (54.1  %)  de¬ 
tected  in  all  four  patients  (Fig.  lc).  The  remaining  groups 
exhibited  significant  overlap  in  three  of  four  (TOPI)  and 
four  of  five  (CENP)  patients,  respectively  (Fig.  la),  along 
with  a  single  outlier  that  showed  either  higher  (SSc  208; 
TOPI)  or  lower  (SSc  226;  CENP)  total  peptide  hits  relative 
to  other  samples  in  these  groups.  Within  TOPI,  111  pro¬ 
teins  (14.2  %)  were  detected  in  all  four  patients  (Fig.  Id), 
while  CENP  exhibited  48  proteins  (10.5  %)  common  to  all 
patients  (Fig.  lc).  The  least  overlap  was  seen  in  healthy 
controls,  with  only  40  proteins  (7.6  %;  Fig.  lb)  common 
across  individuals. 

Across  all  samples,  283  proteins  (25.0  %)  were  de¬ 
tected  in  at  least  one  patient  in  each  of  the  four  auto¬ 
antibody  groups  (Fig.  le,  Additional  file  2:  Table  S2). 
Some  of  these  proteins  likely  represent  background 
signals  (serum  albumin  (AFB),  (3-tubulin  (TUBB),  and 
ribosomal  proteins),  while  others  are  considered  specific 


to  SSc  despite  trace  level  detection  in  controls.  For  ex¬ 
ample,  multiple  SSc  autoantibody  targets,  including  Ku 
(XRCC5  and  XRCC6),  Ro52/TRIM21,  and  nucleophos- 
min/B23  (NPM1)  were  present  in  this  set  of  proteins.  In 
contrast,  87  proteins  (7.7  %)  were  detected  in  all  three 
SSc  groups,  but  were  absent  in  controls  (Fig.  le; 
Additional  file  2:  Table  S2).  Functional  analyses  of  these 
proteins  revealed  strong  enrichment  of  proteins  involved 
in  oxidative  stress  responses  and  nucleic  acid  processing 
(Additional  file  3:  Table  S3B). 

Of  the  1,130  non-redundant  proteins  identified,  473 
(41.8  %)  were  unique  to  a  given  autoantibody  group 
(Fig.  If);  however,  the  vast  majority  of  these  proteins 
were  exclusive  to  a  single  patient,  with  only  111  (23.5  %) 
detected  in  two  or  more  patients.  These  results  suggest  a 
wide  range  of  autoantibody  responses  within  each  of  the 
clinical  autoantibody  groups  beyond  what  has  already 
been  described. 

Among  the  major  autoantibody  groups,  immunopre- 
cipitation  of  RNAP3  was  exclusive  to  the  RNAP3  group, 
with  no  RNAP3  peptides  detected  in  any  of  the  other 
samples  (Table  2).  In  contrast,  TOPI  peptides  were 
consistently  highest  among  TOPI  patients,  but  were  also 
detected  at  low  levels  in  all  four  RNAP3+  patients,  and 
in  two  controls  (Table  2).  As  these  patients  were 
negative  for  TOPI  autoantibodies  by  clinical  testing, 
these  results  indicate  a  higher  degree  of  sensitivity  for 
our  IP/MS  protocol  compared  to  standard  EFISA-based 
methods  used  clinically.  In  contrast,  CENP  was  only 
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Fig.  1  Overview  of  mass  spectrometry  results,  a  Correlation  matrix  of  non-redundant  protein  hits  for  all  patients  and  controls.  Comparisons  were 
performed  using  Fisher's  exact  test  with  the  Bonferroni  correction.  Black  boxes  indicate  intra-group  comparisons  for  each  of  the  four  clinically 
defined  groups.  Green  controls;  red  RNA  polymerase  III  ( RNAP3 );  blue  centromere  protein  (CCA/P);  yellow  topoisomerase  I  {TOPI),  b-f  Venn 
diagrams  depict  overlap  in  non-redundant  peptide  hits  within  and  between  groups,  b  healthy  controls,  c  RNAP3,  d  CENP,  e  TOPI,  and  f 
overlap  between  groups 


detected  at  low  levels  in  the  CENP  group,  likely  because 
it  remained  bound  to  the  tightly  packed  centromere 
complex  of  chromatin. 

Other  known  SSc  autoantigens  were  also  detected. 
RuvBL  [18]  was  strongly  detected  in  all  SSc  samples,  while 
virtually  absent  in  controls.  Ku  and  Su,  along  with  a  wide 
array  of  anti-tRNA  synthetases  [19]  were  routinely  detected 
in  both  the  RNAP3  and  TOPI  subsets,  but  were  only 
weakly  present  in  the  CENP  and  control  groups  (Table  2). 

Several  autoantigens  previously  implicated  in  SSc  were 
found  at  low,  background  levels  in  both  SSc  and  control 
samples.  Ro52/TRIM21  [20]  and  nucleophosmin/B23 
[21]  were  widely  detected  across  all  four  groups,  suggest¬ 
ing  widespread  reactivity  to  these  proteins  in  SSc  and 
controls.  We  did  not  find  evidence  in  SSc  of  enrichment 
of  Pm/Scl  autoantibodies,  which  target  exosome  compo¬ 
nents  EXOSC1-10  [22].  Peptides  for  these  proteins  were 
absent  in  the  CENP  group,  but  were  detected  at  low 
levels  in  other  subsets,  including  controls.  Autoantigens 
not  detected  here  include  many  of  the  URNPs,  PDGFR, 
matrix  metalloproteinases,  tissue  plasminogen  activator, 
and  vascular  receptor  antibodies  (Table  2). 


Functional  clustering  of  identified  proteins 

To  identify  functional  interactions  among  autoantigens, 
all  763  non-redundant  protein  hits  were  submitted  as  a 
query  to  the  GIANT  global  average  network.  This  ap¬ 
proach  included  both  SSc-specific  targets  and  those  de¬ 
tected  at  background  levels  in  controls,  to  better 
understand  the  full  range  of  autoreactive  proteins  and 
complexes.  Nine  distinct  communities  were  identified 
within  the  resulting  network,  in  which  each  gene  is  rep¬ 
resented  by  a  node,  and  two  genes  share  an  edge  if  they 
are  predicted  to  functionally  interact  (Additional  file  4: 
Figure  SI).  Analysis  of  each  of  these  communities  by 
g:Profiler  revealed  functional  enrichment  for  a  wide 
range  of  biological  processes  associated  with  important 
disease  processes  and  components  (Additional  file  4: 
Figure  SI).  Community  1  is  dominated  by  ribosomal 
proteins,  eukaryotic  initiation  factor  3  (eIF3)  subunits, 
and  includes  the  SSc  autoantibody  target  nucleophos- 
min/B23.  Communities  2  and  8  show  strong  enrichment 
for  Gene  Ontology  (GO)  terms  mRNA  processing,  ribo- 
nucleoprotein  complex,  and  cytosolic  stress  granule. 
Community  2  is  dominated  primarily  by  DEAD  box 


Table  2  SSc-associated  autoantibodies  observed  in  this  study 


Alias 

Associated  proteins 

Disease  subset 

Clinical  associations 

Major  autoantibodies 

RNA  Pol  III 

POFR3A 

dSSc 

Renal  crisis,  cancer 

Scl70 

TOPI 

dSSc 

Poor  prognosis,  internal  organ 
involvement,  and  proteinuria 

Centromere 

CENPB,  CENPH 

ISSc/CREST 

PAH,  ILD 

Other  SSc  autoantibodies  present  in  our  dataset 

Endothelial  Cell 

TUBB,  VCL,  FMNA,  RPFPO 

SSc 

PAH 

Fibroblast 

ENOI,  G6PD,  HSPA1  A, 
HSPA1B,  VIM 

SSc 

PAH 

Histone 

HI FX,  HIST1H1B,  HIST1H4A 

SSc 

PF,  internal  organ  involvement, 
decreased  survival 

B23 

NPM1 

dSSc,  CENP-  ISSc 

PAH 

Ku 

XRCC5,  XRCC6 

ISSc 

Myositis 

Su 

AG02 

SSc,  PM/Scl 

Unknown 

Mitochondrial  (M2) 

DLD,  PDHB 

ISSc 

Strong  association  with  primary 
biliary  cirrhosis 

Pm/Scl 

EXOSC1-10 

SSc 

PF,  digital  ulcers;  decreased  risk 
of  PAH  and  Gl  symptoms 

hnRNPs 

HNRNPA1-3,  HNRNPF 

SSc 

Common  in  SARDs 

U1 

SNRNPA,  SPRNP70 

SSc 

Co-occurrence  with  SS-A/SS-B, 
PAH,  overlap  syndrome 

U5 

SNRNP200 

SSc,  PM/Scl 

Unknown 

R052/TRIM21 

TRIM21 

SSc 

IFD,  other  autoimmune  diseases 

RuvB 

RUVBL1,  RUVBF2 

dSSc 

Common  in  SARDs,  older  age  at 
onset,  male  sex 

Annexin  V 

ANXA5 

dSSc,  CENP-  ISSc 

Digital  ischemia 

SS-B/LA 

SS-A,  SS-B 

SSc 

IFD,  other  autoimmune  diseases 

Peroxiredoxin 

PRDX1 

SSc 

Disease  duration,  PF,  cardiac 
involvement,  TOPI  +  patients 

hUBF/NOP90 

UBTF 

ISSc 

Mild  organ  involvement, 
favorable  prognosis 

Th^o 

POP1 

ISSc 

PF,  renal  crisis,  poor  prognosis, 
myositis,  PAH 

Prevalence  in  this  dataset  (avg/freq)  Reference 


Control  (n  =  4) 

RNAP3  (n  =  4) 

TOPI  (n  =  4) 

CENP  (n  =  5) 

- 

+++  (28/4) 

- 

- 

Graf,  et  al.  2012  [1];  Mehra, 
et  al.  2013  [3] 

+  (3/2) 

+  (4/4) 

+++  (19/4) 

- 

Mehra,  et  al.  2013  [3] 

- 

- 

- 

+  (1/2) 

Mehra,  et  al.  2013  [3] 

+  (1/D 

++  (6/4) 

+  (4/2) 

+  (0/1) 

Dib,  et  al.  2012  [34],  Naniwa, 
et  al.  2007  [35] 

+  (3/4) 

+++  (12/4) 

++  (5/3) 

++  (8/5) 

Terrier,  et  al.  2008  [36], 
Terrier,  et  al.  2009  [37] 

+  0/1) 

+  (3/3) 

+  (1/D 

- 

Mehra,  et  al.  2013  [3] 

+  (4/4) 

++  (7/4) 

++  (5/4) 

++  (6/5) 

Mehra,  et  al.  2013  [3] 

+  (3/3) 

+++  (12/4) 

++  (8/4) 

+  (2/3) 

Graf,  et  al.  2012  [1];  Mehra, 
et  al.  2013  [3] 

- 

+  (1/2) 

+  (3/1) 

- 

Satoh,  etal.  2013  [38] 

+  0/1) 

+  (2/3) 

+  (1/D 

- 

Mehra,  et  al.  2013  [13] 

+  (2/2) 

++  (5/3) 

+  (2/2) 

- 

Mehra,  etal.  2013  [13] 

+  (0/1) 

++  (7/4) 

+  (3/4) 

+  (2/4) 

Siapka,  et  al.  2007  [39] 

- 

+  (2/4) 

+  d/2) 

+  (0/1) 

Graf  et  al.  2012  [1];  Mehra, 
etal.  2013  [3] 

++  (6/3) 

++  (9/4) 

++  (8/3) 

+  (1/2) 

Kubo,  et  al.  2002  [40] 

++  (6/3) 

+++  (12/4) 

++  (6/4) 

++  (8/4) 

Mehra,  et  al.  2013  [3] 

+  0/1) 

++  (7/4) 

+  (3/4) 

+  (2/4) 

Kaji,  etal.  201 4  [18] 

+  (2/2) 

++  (7/4) 

+  (4/3) 

+  (3/4) 

Mehra,  et  al.  2013  [3] 

- 

+  (3/4) 

+  (2/2) 

+  (0/1) 

Mehra,  etal.  2013  [3] 

+  (2/4) 

++  (8/4) 

+  (3/3) 

+  (4/4) 

Mehra,  etal.  2013  [3] 

- 

+  (1/2) 

- 

- 

Mehra,  etal.  2013  [3] 

+  (1/2) 

+  0/1) 

+  (3/3) 

_ 

Graf  et  al.  2012  [1];  Mehra, 

et  al.  2013  [3] 
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Table  2  SSc-associated  autoantibodies  observed  in  this  study  (Continued) 


PL-12 

AARS 

SSc,  PM/DM 

ILD  without  myositis 

- 

- 

+  (1/1) 

+  0/1) 

Hamaguchi,  et  al.  2013  [19] 

OJ 

IARS 

SSc,  PM/DM 

ILD  without  myositis 

+  0/1) 

- 

+  (3/3) 

- 

Hamaguchi,  et  al.  2013  [19] 

EJ 

GARS 

SSc,  PM/DM 

ILD,  myositis 

- 

+  (3/4) 

+  (2/2) 

+  (0/1) 

Hamaguchi,  et  al.  2013  [19] 

Jo-1 

HARS 

SSc,  PM/DM 

ILD,  myositis 

- 

+  0/4) 

- 

- 

Hamaguchi,  et  al.  2013  [19] 

PL-7 

TARS 

SSc,  PM/DM 

ILD,  myositis 

+  (2/2) 

++  (8/4) 

++  (6/4) 

+  (2/4) 

Hamaguchi,  et  al.  2013  [19] 

Ha 

YARS 

SSc,  PM/DM 

Interstitial  pneumonia 

- 

+  (0/1) 

- 

- 

Hashish,  et  al  2005  [41] 

Zo 

FARSA,  FARSB 

SSc,  PM/Scl 

Anti-synthetase  syndrome 

- 

+  (2/4) 

- 

- 

Betteridge,  et  al.  2007  [42] 

SSc  autoantibodies  not  detected  in  our  dataset 

Fibrillarin 

U3RNP 

dSSc 

More  frequent  in  blacks;  severe 
disease,  poor  prognosis 

- 

- 

- 

- 

Mehra,  et  al.  2013  [3] 

U11/U12  RNP 

SNRNP35 

SSc 

Lung  fibrosis,  gastrointestinal 
involvement 

- 

- 

- 

- 

Mimori,  1999  [43] 

PDGFR 

PDGFR 

SSc 

Unknown 

- 

- 

- 

- 

Svegliati  Baroni,  et  al. 

2006  [44] 

MMP 

MMP  family 

dSSc 

Skin,  lung,  and  vascular  fibrosis 

- 

- 

- 

- 

Mehra,  et  al.  2013  [3] 

tPA 

PLAT 

ISSc 

PAH 

- 

- 

- 

- 

Mehra,  et  al.  2013  [3] 

1  F1 1 6 

IFI16 

ISSc 

Common  in  SARDs 

- 

- 

- 

- 

Mehra,  et  al.  2013  [3] 

Fibrillin  1 

FBN1 

dSSc 

Choctaw  and  Japanese  patients; 
absent  in  Caucasians 

- 

- 

- 

- 

Mehra,  et  al.  2013  [3] 

Vascular  Receptors 

AGTR2,  EDN1 

SSc 

TOP1+  patients,  renal  crisis 

- 

- 

- 

- 

Mehra,  et  al.  2013  [3] 

ATF2 

ATF2 

SSc 

Longer  disease  duration, 
decreased  lung  function 

- 

- 

- 

- 

Mehra,  et  al.  2013  [3] 

Data  are  presented  as  the  average  of  all  peptide  hits  across  each  autoantibody  group,  followed  by  the  frequency  of  peptide  detection  within  the  group.  For  autoantibodies  known  to  target  more  than  one  protein  or 
subunit,  data  for  a  single  representative  protein  are  shown,  with  the  specific  protein  highlighted  in  bold.  Associated  proteins  indicate  specific  protein  targets  identified  in  this  study;  among  autoantibodies  not 
identified  here,  the  most  common  targets  are  listed.  Symbols:  -,  +,  ++,  and  +++  indicate  an  average  of  0,  1-4,  5-9,  and  >10  peptide  hits  per  group,  respectively 

SSc  systemic  sclerosis,  ISSc  limited  cutaneous  SSc,  dSSc  diffuse  cutaneous  SSc,  PAH  pulmonary  arterial  hypertension;  ILD  interstitial  lung  disease;  CREST  CREST  syndrome  (calcinosis,  Raynaud  phenomenon,  esophageal 
dysmotility,  sclerodactyly,  and  telangiectasia);  PM/Scl  polymyositis/scleroderma,  PM/DM  polymyositis/dermatomyositis 
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helicases  proteins,  while  community  8  contains  a  diverse 
array  of  proteins  including  multiple  SSc  autoantibodies, 
including  TOPI,  SSB,  Pm/Scl  proteins,  URNPs,  and 
HNRNPs,  and  numerous  serine/arginine-rich  splicing 
factors.  Community  3  consists  primarily  of  aminoacyl 
tRNA  synthetases,  a  cluster  often  targeted  in  auto¬ 
immune  diseases  [19,  23].  Communities  4,  5,  and  9  are 
strongly  associated  with  a  variety  of  GO  processes 
known  to  play  a  major  role  in  SSc,  including  wound 
healing,  IFN  signaling,  and  response  to  oxidative  stress. 
Major  proteins  include  CD44,  HLAs,  myosins,  and  fila- 
min  proteins  in  community  4  and  tricarboxylic  acid 
cycle  proteins  in  community  5.  Community  9  contains 
multiple  protein  disulfide  isomerases  and  peroxiredox- 
ins,  protein  folding  enzymes  such  as  calnexin  (CANX) 
and  calreticulin  (CALR),  and  the  major  collagen  process¬ 
ing  enzyme  prolyl  4-hydroxylase  beta  (P4HB).  Commu¬ 
nity  6  contains  multiple  annexin  and  14-3-3  proteins; 
enriched  GO  processes  include  ribonucleoprotein  com¬ 
plex  assembly,  mitochondrial  transport,  RNA  processing, 
and  anchoring  junction.  Community  7  associated  with 
GO  terms  include  cell  cycle,  RNA  polymerase  III  com¬ 
plex,  DNA-PK-Ku  complex,  and  antigen  processing  and 
presentation.  Community  7  includes  several  SSc  auto¬ 
antibody  targets  including  Ku  proteins  XRCC5  and  6, 
RUVBL1  and  2,  RNA  polymerase  I  and  II  subunits,  mul¬ 
tiple  proteasomal  subunits,  and  T-complex  proteins. 

Preferential  detection  of  autoantibodies  in  SSc 

Subsequent  comparisons  between  groups  were  performed 
in  a  semiquantitative  manner  based  on  the  presence  or  ab¬ 
sence  of  a  given  protein  in  an  immunoprecipitant,  with 
quantitative  analyses  limited  to  comparisons  within  an 
individual  sample.  To  identify  biological  processes  and 
cellular  components  differentially  targeted  in  SSc,  with 
minimal  to  no  background  detection  in  controls,  we  ex¬ 
amined  all  proteins  detected  in  >50  %  of  SSc  samples  at  a 
frequency  >  1.5-fold  relative  to  controls,  resulting  in  a  list 
of  137  differentially  detected  proteins  (Fig.  2;  Additional 
file  2:  Table  S2).  Enriched  biological  processes  included 
ncRNA  metabolic  process ,  response  to  oxygen  radical,  and 
triglyceride-rich  lipoprotein  particle  remodeling.  Preferen¬ 
tially  targeted  cellular  components  include  cytosolic  stress 
granule,  lipid-protein  complex,  pigment  granule,  and  an¬ 
choring  junction;  molecular  functions  include  antioxidant 
activity  and  mRNA  binding  (Additional  file  3:  Table  S3C). 

RNA  processing  centers  are  major  targets  of  SSc 
autoantibodies 

The  strong  enrichment  for  GO  terms  associated  with 
mRNA  processing  and  stress  response,  as  well  as  the 
identification  of  cytosolic  stress  granule  as  an  enriched 
cellular  component,  led  us  to  further  investigate  the  role 
of  stress  granules  (SG)  and  RNA  processing  bodies  (PB) 


in  the  autoantibody  response  of  SSc.  SG  and  PB  repre¬ 
sent  distinct,  non-membranous  cytoplasmic  entities, 
which  arise  in  response  to  different  cellular  stresses 
including  oxidative  stress,  hypoxia,  viral  infection,  un¬ 
folded  proteins,  and  amino  acid  deprivation  [24]. 
These  structures  exist  in  constant  flux,  driven  by  the 
availability  of  constituent  mRNPs,  regulating  the  fate 
of  untranslated  mRNAs  in  response  to  translational 
arrest  [25].  While  SG  are  generally  absent  under  nor¬ 
mal  conditions,  PB  are  constitutively  present  at  low 
levels  due  to  their  role  as  microRNA  processing  cen¬ 
ters.  Both  structures  have  been  shown  to  arise  in  re¬ 
sponse  to  cellular  stresses,  including  oxidative  stress, 
ischemia,  and  cancer  [26],  all  of  which  are  known  to 
be  important  in  SSc  pathogenesis  [5,  27]. 

In  addition  to  the  137  differentially  detected  proteins  de¬ 
scribed  above,  a  wide  range  of  PB/SG  constituents  were 
readily  detected  across  most  SSc  samples  (Additional  file  5: 
Table  S4).  Substantial  reactivity  was  seen  against  PB  com¬ 
ponents  such  as  UPF1  and  MOVIO,  and  SG  proteins  FXR1 
and  FXR2,  G3BP1  and  G3BP2,  and  USP10.  Only  back¬ 
ground  levels  of  reactivity  were  seen  in  healthy  controls. 

Validation  of  PB/SG  antibodies  in  SSc 

In  order  to  validate  the  differential  abundance  of  PB/SG 
proteins  identified  by  LC-MS/MS,  HeLa  whole  cell  ly¬ 
sates  were  immunoprecipitated  using  antibodies  from 
each  patient  as  described  in  the  LC-MS/MS  analyses. 
Western  blots  were  performed  by  resolving  equal  vol¬ 
umes  of  IP  eluates  by  SDS-PAGE  and  transferring  to 
nitrocellulose.  Blots  were  then  probed  with  antibodies 
targeting  PB/SG  proteins  UPF1,  MOVIO,  CAPRIN1, 
G3BP1,  and  USP10.  These  targets  were  chosen  based 
upon  their  high  level  of  detection  across  SSc  groups,  as 
determined  based  on  our  LC-MS/MS  data.  Strong  re¬ 
activity  was  seen  against  all  five  proteins  in  SSc  with 
only  trace  levels  detected  in  controls  (Fig.  3a),  indicating 
widespread  immune  responses  against  these  protein 
complexes. 

Further  validation  was  performed  using  immunofluor¬ 
escence  (IF)  staining  of  U20S  cells  maintained  under 
conditions  of  oxidative  stress  to  induce  PB/SG  forma¬ 
tion.  Cells  were  probed  with  patient  sera  in  combination 
with  PB  and  SG  markers  SKl-Hedls  and  eIF3b,  respect¬ 
ively.  Co-localization  between  patient  sera  and  PB/SG 
markers  was  observed  in  six  of  nine  SSc  patients,  with  at 
least  one  positive  sample  in  each  of  the  three  autoanti¬ 
body  groups;  no  staining  was  seen  for  any  of  the  three 
healthy  controls  (Fig.  3b).  These  results  are  consistent 
with  that  seen  on  LC-MS/MS,  particularly  among 
RNAP3  patients,  who  exhibited  the  strongest  and  most 
consistent  autoantibody  responses  across  both  methods. 
Taken  together,  these  data  strongly  implicate  PB/SG  as 
novel  targets  of  SSc  autoantibody  responses. 
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Fig.  2  Proteins  differentially  detected  in  systemic  sclerosis  (SSc).  Semiquantitative  enrichment  of  SSc-associated  autoantibodies  was  determined 
using  a  binary  assessment  of  autoantibody  presence  or  absence  in  a  sample.  Preferential  enrichment  in  SSc  was  defined  as  all  proteins  detected 
in  >50  %  of  all  patient  samples  at  a  frequency  >1. 5-fold  relative  to  controls,  a  Heat  map  of  proteins  differentially  detected  in  SSc.  b  Network 
analysis  of  differentially  detected  proteins.  Community  detection  was  performed  using  the  GIANT  global  network;  functional  annotation  was 
performed  using  gProfiler 


Discussion 

Autoantibodies  have  long  been  used  in  the  diagnosis  of 
SSc,  with  different  autoantibodies  predictive  of  clinical 
outcomes,  including  interstitial  lung  disease,  pulmonary 
arterial  hypertension,  and  skin  involvement.  While  a 
wide  array  of  SSc-associated  autoantibodies  have  been 
described,  diagnoses  are  often  performed  based  upon 
the  presence  or  absence  of  reactivity  against  three  pro¬ 
teins:  RNAP3,  TOPI,  and  CENP.  The  data  presented 
here  suggest  a  much  broader  autoantibody  response, 
which  is  reflective  of  underlying  disease  pathologies. 
Strong  subset-specific  reactivity  was  evident  against  both 
RNAP3  and  TOPI,  with  no  RNAP3  peptides  detected  in 
any  of  the  other  groups;  however,  all  four  RNAP3  pa¬ 
tients  exhibited  modest  reactivity  against  TOPI,  indicat¬ 
ing  a  degree  of  overlap  between  these  two  autoantibody 
groups.  When  peptides  recovered  are  extended  beyond 
the  three  major  targets,  we  found  substantial  overlap 


across  the  three  major  SSc  groups.  We  found  peptides 
from  the  autoantigens  of  RuvBLl/2,  which  appear  to  act 
as  general  markers  of  SSc,  with  consistent  detection 
across  all  SSc  groups,  with  almost  no  reactivity  seen  in 
controls.  In  contrast,  some  common  SSc  autoantigens 
such  as  B23  and  Ro52/TRIM21  were  recovered  in  virtu¬ 
ally  all  samples,  and  in  controls,  indicating  an  important 
degree  of  baseline  reactivity  against  some  of  the  more 
common  autoantibody  targets. 

In  this  proof-of-concept  study,  we  do  not  attempt  to 
address  the  clinical  implications  of  the  autoantibody  re¬ 
sponses  described  here  due  to  the  limited  number  of  pa¬ 
tients  analyzed,  nor  are  we  able  to  speculate  on  the 
presence  or  absence  of  these  autoantibodies  in  other  SSc 
autoantibody  groups.  Our  depth  in  this  study  comes  from 
the  number  of  potential  antigens  analyzed,  which  cover 
the  full  proteome.  Future  studies  examining  a  much  larger 
cohort  of  SSc  patients,  along  with  representatives  of  other 
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Fig.  3  Validation  of  RNA  processing  bodies  {PB)/ stress  granules  (SG)  as  a  target  of  the  SSc  autoimmune  response,  a  HeLa  cell  lysates 
were  immunoprecipitated  using  patient  sera,  resolved  by  SDS-PAGE,  and  probed  with  antibodies  targeting  known  PB  and  SG  proteins; 
HeLa  whole  cell  lysate  was  used  as  a  control,  b  Immunofluorescence  was  performed  in  U20S  cells  treated  with  sodium  (meta)arsenite 
to  induce  the  formation  of  SG.  Cells  were  then  fixed  with  4  %  paraformaldehyde  and  permeabilized  with  5  %  normal  horse  serum 
and  0.1  %  digitonin  in  Tris-buffered  saline.  Staining  was  performed  with  anti-elF3b  (SG  marker),  anti-SKI -Hedls  (PB  marker),  and  patient 
sera.  Representative  images  depicting  co-localization  between  patient  sera  and  SG/PB  markers  are  shown,  with  sites  of  co-localization 
circled  in  red.  RNAP  RNA  polymerase,  CENP  centromere  protein  TOPI  topoisomerase  I 


autoimmune  diseases,  will  be  necessary  to  determine  the 
clinical  value  of  these  potential  autoantibodies. 

This  is  not  the  first  study  to  suggest  the  presence  of 
multiple  autoantibodies  in  SSc.  Immunoassays  per¬ 
formed  by  Op  De  Beeck,  et  al.  revealed  the  presence  of 
multiple  autoantibodies  in  a  small  subset  of  SSc  patients 
[28].  A  similar  analysis  by  Graf  et  al.  using  the  EURO¬ 
LINE  immunoassay  revealed  the  presence  of  multiple 
autoantibodies  in  11  %  of  patients  [1]. 

Autoantibodies  against  extracellular  immune  signaling 
receptors  and  extracellular  matrix  proteins  were  con¬ 
spicuously  absent  in  these  data;  this  includes  the  absence 
of  numerous  autoantibodies  previously  implicated  in  SSc 
pathogenesis,  such  as  anti-fibrillin  1,  anti-MMP,  and 
anti-PDGFR  [29].  Additional  analyses  in  other  cell  types, 
such  as  fibroblasts  or  endothelial  cells,  and  cells  main¬ 
tained  under  physiologically  relevant  growth  conditions, 
such  as  immune  activation  or  oxidative  stress,  may  be 


useful  for  identifying  other  proteins  and  complexes 
which  may  play  a  role  in  disease  pathogenesis. 

In  addition  to  identifying  novel  autoantibody  targets, 
the  unbiased  nature  of  mass  spectrometry  provides  add¬ 
itional  insights  into  the  processes  potentially  underlying 
autoimmunity.  The  preferential  detection  of  proteins  as¬ 
sociated  with  RNA  processing  and  oxidative  stress  as  a 
general  feature  of  SSc  autoantibodies  may  be  indicative 
of  their  origins.  Combined  with  the  consistent  targeting 
of  PB/SG  described  here  spanning  all  SSc  patients,  these 
data  suggest  a  basic  model  in  which  disease-specific 
pathologies  give  rise  to  specific  autoantibodies.  Strong 
induction  of  SGs  is  observed  in  response  to  cellular 
stresses,  including  oxidative  stress  and  ischemia,  two 
well-established  phenomena  in  SSc  [27].  SG/PB  are  also 
readily  induced  in  response  to  the  tumor  microenviron¬ 
ment,  consistent  with  recent  evidence  linking  RNAP3- 
positive  SSc  and  cancer  [5,  30].  Combined  with  evidence 
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linking  transforming  growth  factor  (TGF)-|3  signaling 
with  an  increase  in  PB  formation  [31],  many  of  the 
major  processes  underlying  SSc  pathogenesis  appear 
broadly  consistent  with  an  immune  response  against 
cells  undergoing  a  stress  response.  PBs  are  also  known 
to  associate  with  other  cytoplasmic  structures,  such  as  U 
bodies  [32],  which  house  an  number  of  well-established 
SSc  autoantibody  targets,  including  Ul,  U5,  and  U11/U12. 
Taken  together,  these  data  suggest  a  model  in  which  auto¬ 
antibodies  arise  as  a  secondary  phenotype  in  response  to 
SSc-related  processes  already  underway. 

Some  evidence  suggesting  a  link  between  PBs  and 
autoimmunity  has  been  described  previously.  Bhanji,  et 
al.  observed  reactivity  against  a  number  of  PB-associated 
proteins,  including  GEl/Hedls ,  GW182,  and  Ago2  in  a 
range  of  autoimmune  diseases,  including  SSc  [33].  Among 
these  autoantigens,  only  Ago2  was  also  identified  in  this 
analysis,  suggesting  a  persistent,  yet  diverse  response 
against  this  complex.  A  similar  analysis  of  SG-associated 
proteins  has  not  been  performed;  however  given  the  re¬ 
activity  to  PB  proteins  seen  in  other  autoimmune  diseases, 
reactivity  in  other  autoimmune  diseases  is  possible.  A  de¬ 
tailed  analysis  using  both  mass  spectrometry  and  other 
methods  will  be  necessary  to  understand  the  degree  to 
which  autoantibodies  to  SG/PBs  are  seen  in  other  auto¬ 
immune  diseases,  and  the  clinical  implications  of  these 
findings. 

This  work  has  several  limitations.  First,  we  cannot 
eliminate  the  possibility  that  some  proteins  found  in  our 
mass  spectrometry  data  result  from  co-IP  of  multi¬ 
protein  complexes  by  a  single  autoantibody;  however,  we 
were  able  to  confirm  the  presence  of  multiple  PB/SG 
autoantibodies  by  other  means  (Fig.  3).  We  also  cannot 
rule  out  the  possibility  that  some  targets  were  missed 
due  to  their  being  sequestered  into  tightly  packed 
molecular  complexes  associated  with  chromatin.  For 
example,  the  presence  of  CENP  autoantibodies  within 
these  samples  had  been  established  using  clinical  methods, 
indicating  its  absence  in  our  mass  spectrometry  data  is 
likely  a  result  of  its  sequestration  into  large  macromolecu- 
lar  complexes  with  limited  solubility.  A  lack  of  age-  and 
gender- matched  controls  likely  underestimates  the  degree 
of  baseline  reactivity  seen  in  unaffected  controls.  Finally, 
the  small  number  of  patient  samples  used  in  this  study 
prevents  any  clinical  interpretation,  and  the  variability  in 
the  number  of  peptides  recovered  between  experiments 
limits  direct  quantitative  comparisons  between  autoanti¬ 
body  groups. 

Conclusions 

The  data  presented  here  provide  evidence  of  diverse  im¬ 
mune  reactivities  in  SSc  targeting  a  wide  array  of  protein 
complexes.  Among  these  complexes,  autoantibodies  tar¬ 
geting  PB/SG  were  consistently  identified  across  both 


clinical  SSc  subsets  and  major  autoantibody  groups,  sug¬ 
gesting  a  potential  novel  autoantibody  target.  Taken  to¬ 
gether,  these  data  suggest  immune  responses  to  proteins 
involved  in  cellular  stress  may  be  a  common  mechanism 
for  autoantibody  generation. 
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Additional  file  1:  Table  SI.  Complete  list  of  peptides  identified  in  this 
analysis.  TP  number  of  total  peptides  mapping  to  a  protein,  UP  number 
of  unique  peptides  mapping  to  a  protein,  UM  number  of  non-redundant 
peptides  mapping  exclusively  to  a  protein,  MW  molecular  weight,  Length 
protein  length  in  amino  acids.  (XLS  639  kb) 

Additional  file  2:  Table  S2.  Gene  lists  used  in  these  analyses. 

(XLSX  17  kb) 

Additional  file  3:  Table  S3.  Systemic  sclerosis  (SSc)-specific  enrichment 
of  processes  and  components.  Proteins  differentially  detected  in  SSc 
were  analyzed  using  gProfiler.  Statistically  significant  processes  and 
components  are  shown.  A)  Peptides  detected  at  any  level  across  all 
four  groups.  B)  Peptides  identified  in  all  SSc  groups,  but  absent  in 
controls.  C)  Analysis  of  137  proteins  differentially  detected  in  SSc.  BP 
biological  process,  CC  cellular  component,  MF  molecular  function,  ke 
KEGG  pathway,  re  REACTOME  pathway.  (ODS  70  kb) 

Additional  file  4:  Figure  SI.  Network  analysis  of  systemic  sclerosis 
(SSc)  autoantigens.  All  763  non-redundant  peptide  hits  identified  in  two 
or  more  patients  were  analyzed  using  the  Genome-scale  Integrated 
Analysis  of  gene  Networks  in  Tissues  (GIANT)  global  network  to  identify 
functionally  associated  protein  networks.  Analysis  of  community  function 
was  performed  using  gProfiler.  SSc-associated  autoantibodies  are  highlighted 
in  yellow.  (EPS  1902  kb) 

Additional  file  5:  Table  S4.  Processing  body  and  stress  granule 
proteins  identified  in  this  analysis.  *Proteins  with  multiple  subunits.  Data 
indicate  non-redundant  peptide  hits.  (ODS  20  kb) 
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